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Abstract Coronaviruses infect many species of animals including humans, 

causing acute and chronic diseases. This review focuses primarily 
on the pathogenesis of murine coronavirus mouse hepatitis virus 
(MHV) and severe acute respiratory coronavirus (SARS-CoV). MHV 
is a collection of strains, which provide models systems for the 
study of viral tropism and pathogenesis in several organs systems, 
including the central nervous system, the liver, and the lung, and 
has been cited as providing one of the few animal models for the 
study of chronic demyelinating diseases such as multiple sclerosis. 
SARS-CoV emerged in the human population in China in 2002, 
causing a worldwide epidemic with severe morbidity and high 
mortality rates, particularly in older individuals. We review the 
pathogenesis of both viruses and the several reverse genetics 
systems that made much of these studies possible. We also review 
the functions of coronavirus proteins, structural, enzymatic, and 
accessory, with an emphasis on roles in pathogenesis. Structural 
proteins in addition to their roles in virion structure and morpho¬ 
genesis also contribute significantly to viral spread in vivo and in 
antagonizing host cell responses. Nonstructural proteins include 
the small accessory proteins that are not at all conserved between 
MHV and SARS-CoV and the 16 conserved proteins encoded in the 
replicase locus, many of which have enzymatic activities in RNA 
metabolism or protein processing in addition to functions in antag¬ 
onizing host response. 


I. INTRODUCTION 

Coronaviruses, a family of viruses within the Nidovirus superfamily, 
were divided into three groups (1, 2, 3), originally based on antigenic 
reactivity, later confirmed by genome sequencing. Recently, a new taxo¬ 
nomic nomenclature was adapted by the International Committee on 
Taxonomy of Viruses (2009) (http://talk.ictvonline.org/media/g/verte- 
brate-2008/default.aspx). As such, coronaviruses are divided into three 
genera (alpha, beta and gammacoronaviruses), corresponding to groups 
1, 2, 3, within the subfamily coronavirinae, within the family of corona- 
viridae, and within the order or superfamily of nidovirales. 
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Coronaviruses cause diseases in a variety of domestic and wild animals as 
well as in humans. Probably the most well-studied coronavirus is the 
betacoronavirus, murine coronavirus (MuCoV), mouse hepatitis virus 
(commonly referred to as MHV) that has long provided model systems 
for the study of central nervous system (CNS) diseases such as encephali¬ 
tis and multiple sclerosis (MS) and acute hepatitis. While most coronavi¬ 
rus infections cause the common cold in humans, the emergence of the 
agent for severe acute respiratory syndrome (SARS), the SARS-associated 
coronavirus (SARS-CoV), also a betacoronavirus, demonstrated the 
potential for further significant human diseases to result from coronavi¬ 
rus infections. Indeed, shortly after the identification of the SARS-asso- 
ciated human coronavirus (HCoV), new coronavirus were identified in 
association with more severe infections in humans, NL63 an alphacoro- 
navirus, believed to cause bronchiolitis in children, and HKU1, a betacor¬ 
onavirus, associated with chronic respiratory disease in the elderly (Pyre 
et al., 2007). This review will concentrate on the model MuCoV and the 
human SARS-CoV. 


II. GENOME AND VIRION 

Coronaviruses are enveloped positive strand RNA viruses with the larg¬ 
est known RNA genomes, of 30-32 kb (Fig. 1). All coronavirus genomes 
are arranged similarly with the replicase locus encoded within the 5' end 
and the structural proteins encoded in the 3' third of the genome arranged 
in the order hemagglutinin esterase (HE), if present (HE is only present in 
some betacoronaviruses), spike (S), small membrane (E), membrane (M) 
and nucleocapsid (N) and internal (I) protein, encoded within the N gene 
(Fig. 1). The nucleocapsid protein complexes with the genome RNA to 
form a helical capsid structure found within the viral envelope. Trimers of 
the spike protein form the peplomers embedded in the envelope giving 
the virion its corona or crown-like morphology. In some coronavirus 
virions, the HE protein forms smaller spikes on the membrane. M and E 
are also transmembrane proteins involved in virus assembly (Fig. 2). 

The 5' end of the coronavirus genome encodes the replicase gene, 
containing two very large open reading frames (orfs), orfla and orflb, 
encompassing about 20 kb or two-thirds of the genome. The replicase is 
translated as two large polyproteins (pp) la and lab, with pplab 
expressed via a translational frame shift encoded near the end of orfla. 
These replicase polyproteins are cotranslationally cleaved into 16 pro¬ 
teins, many of which have enzymatic activities, including two or three 
proteases, several RNA modification enzymes as well as a polymerase 
and helicase, as will be discussed below. Intermingled with the structural 
genes are a variable number of accessory nonstructural genes encoding 
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FIGURE 1 Genome organization and replicase encoded nonstructural proteins. (A) The 
genomes of MHV-JHM.SD and SARS-CoV are diagrammed. L, leader; ORFla/lb, replicase; 
structural genes: HE, hemagglutinin-esterase; S, spike; E, small membrane envelope; 

M, membrane; N, nucleocapsid; I, internal, orfs encoding accessory genes are designated 
with numbers. (B). Arrows indicate cleavage sites for orfla, orflab encoded polypeptides 
and numbers indicate individual nsp cleavage products. 



FIGURE 2 Coronavirus virion structure. The genome RNA is complexed with the N 
protein to form a helical cased within the viral membrane, HE, hemagglutinin-esterase; 
S, spike; E, small membrane envelope; M, membrane are all transmembrane proteins. 
(Reproduced from Finlay and Hancock, 2004). 


usually small, accessory proteins not essential for replication in cell cul¬ 
ture. These proteins differ in number, sequence, and function among 
coronavirus groups and between MHV and SARS-CoV. It has been 
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widely speculated that these proteins mediate virus host interactions, and 
there are some new data suggesting important functions for some of these 
proteins, as will be discussed below. 


III. CORONAVIRUS-INDUCED DISEASES 

A. MHV pathogenesis 

The MHV is a collection of strains with different organ tropisms. MHV 
strains may be divided into two major biotypes, based on general patterns 
of tropism. One group is enterotropic and includes MHV-D, -Y, -RI, -S/ 
CDC, LIVIM, and DVIM; these viruses are the frequent cause of MHV 
outbreaks in housed rodent colonies (Homberger et ah, 1998). The other 
biotype, the polytropic strains are those generally studied as models of 
human disease. Various strains from this group provide model systems 
for diseases of several organ systems. Neurotropic MHV strains induce 
acute encephalitis and chronic demyelinating diseases, serving as one of 
the few recognized mouse models for MS. Hepatotropic strains provide 
one of the few small animal models for viral hepatitis, and the pneumo¬ 
tropic MHV-1 strain induces severe pneumonitis and reproduces the 
pathology of SARS. Curiously, despite the very different organ tropisms, 
all MHV strains use the same cellular receptor, carcinoembryonic antigen 
molecule (CEACAM)-l, with no known requirements for coreceptors, 
suggesting that MHV tropism is in part determined by postviral entry 
events. 


1. Central nervous system disease 

The most frequently studied MHV strains are the neurotropic ones, pri¬ 
marily JHM and A59. The original JHM isolate, recovered from a paral¬ 
yzed mouse, was highly neurovirulent, inducing encephalomyelitis with 
extensive demyelination (Bailey et ah, 1949; Cheever et ah, 1949). It was 
subsequently passaged multiple times through mouse brains (Lavi et ah, 
1984a; Weiner, 1973; Weiner et ah, 1973). From this mouse brain-adapted 
stock, various clones with very different pathogenic phenotypes were 
isolated and used in many labs, all under the name JHM, causing confu¬ 
sion as to the actual phenotype of JHM. More recently, attempts have been 
made to differentiate among the JHM isolates, as described further below 
(Table I). Among the various JHM isolates, some induce severe encepha¬ 
litis and high mortality and others induce more mild acute disease fol¬ 
lowed by chronic demyelination; the origins and pathogenic phenotypes 
of the various strains has been reviewed recently (Bender and Weiss, 
2010; Weiss and Leibowitz, 2008). The A59 strain is a relatively 


TABLE I Neurotropic MHV strains 


MHV strain 
JHM.SD (MHV-4) 


V5A13.1 (mAb escape 
mutant of JHM.SD) 

OBLV60 (variant of JHM. 
SD isolated from 
persistently infected 
OBL21A cells) 

JHM-DL 

2.2-V-l (mAb escape 
mutant of JHM-DL) 

JHM cl-2 

srr7 (soluble receptor- 
resistant mutant of 
JHM cl-2) 


Pathogenesis 

Highly lethal; severe 
encephalitis 

Neuroattenuated; 
spreads more slowly 
in CNS 

Neuroattenuated 


Highly lethal 

Neuroattenuated; 
subacute 
demyelination 
Highly lethal 

Neuroattenuated 


Tropism 


Spike/spread 


References 


Neurons, glial cells 
Neurons, glial cells 
Olfactory bulb neurons 

Neurons, glial cells 

Glial cells, primarily 
oligodendrocytes 

Neurons, glial cells 

Macrophages / 
microglia (in vitro) 


Gly310; Leulll4; 
CEACAM1- 
independent spread 

HVR deletion (142 aa) 

L1114R; CEACAM1- 
dependent spread 

Leulll4 

L1114F; CEACAM1- 
dependent spread 

Gly310; Leulll4; 
CEACAM1- 
independent spread 

L1114F; CEACAM1- 
dependent spread 


Dalziel et al. (1986) 


Fazakerley et al. (1992) 


Gallagher et al. (1991), 
Pearce et al. (1994) 


Stohlman et al. (1982), 
Wang et al. (1992) 
Fleming et al. (1986), 
Wang et al. (1992) 

Taguchi et al. (1985) 


Matsuyama et al. 
(2001), Nakagaki 
and Taguchi (2005) 




JHM.IA 

Highly lethal, but less 
than JHM.SD 

Neurons, glial cells 

S310; Leulll4; 
CEACAM1- 
dependent spread 

Ontiveros et al. (2003) 

rJHM.IA.S310G (mutant 
of JHM.IA) 

Highly lethal; more than 
JHM.IA 

Neurons, glial cells 

S310G; CEACAM1- 
independent spread 

Ontiveros et al. (2003) 

JHM. WU (MHV Wb3) 

Highly neurovirulent, 
highly hepatotropic 

Not determined 

CEACAM1- 
dependent; HVR 
deletion (140 aa) 

Schwarz et al. (1990), 
Zhao et al. (2011) 

A59 

Neuroattenuated; mild 
encephalitis; subacute 
demyelination; 
hepatitis 

Neurons, glial cells 

HVR deletion (52 aa); 
CEACAM1- 
dependent spread 

Lavi et al. (1984a,b), 
Phillips et al. (2002) 
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neuroattenuated, yet moderately hepatovirulent strain that was isolated 
in 1961 from a mouse with leukemia (Manaker et al., 1961). 

The general paradigm for neurotropic MHV infection can be summar¬ 
ized as follows. Following intracranial or intranasal inoculation, neuro¬ 
tropic MHV infects all of the major CNS cell types including neurons, the 
most frequently infected cell type, and glial cells, astrocytes, oligoden¬ 
drocytes, and microglia. Viral titers typically peak in the CNS at day 5 
postinfection and then begin to decline (Leparc-Goffart et al., 1998), with 
infectious virus becoming undetectable by approximately 2 weeks post¬ 
infection (Matthews et al., 2002). Infected mice develop mild to severe 
encephalomyelitis, characterized by infiltration of a variety of inflamma¬ 
tory cells. Innate immune responses are detectable within the first few 
days postinfection, followed by the development of an adaptive immune 
response (Bergmann et al., 2006; Savarin and Bergmann, 2008). Virus is 
cleared primarily by CD8 + T-cells with help from CD4 + T-cells 
(Williamson et al., 1991). However, despite clearance of infectious virus, 
viral RNA, both genome and mRNA persist in the CNS and demyelin- 
ation, largely immune-mediated, develops, peaking at approximately 1 
month postinfection (Lavi et al., 1984a,b; Marten et al., 2001). 

Among the highly neurovirulent isolates are JHM.SD (San Diego, for¬ 
merly called MHV-4; Dalziel et al., 1986; Ontiveros et al., 2003), JHM.IA 
(Iowa), JHM.WU (Wurzburg, previously called Wb3; Schwarz et al., 1990), 
JHM-DL (Stohlman et al., 1982; Wang et al, 1992), and JHM-cl2 (Taguchi 
et al., 1995). These isolates kill weanling mice with a lethal dose (LD) 50 of 
<10 pfu following intracranial inoculation. There are subtle phenotypic 
differences among these isolates which map to the spike gene as well as to 
other viral genes, as discussed further below. The most neurovirulent 
strains (e.g., JHM.SD, JHM-cl2) are able to spread cell to cell in the absence 
of the only known MHV receptor, CEACAMla (Gallagher and Buchmeier, 
2001). JHM 2.2-V-l (Fleming et al., 1986; Wang et al., 1992), an attenuated 
monoclonal antibody escape variant, is glialtropic and nonlethal in immu¬ 
nocompetent mice; however, JHM 2.2-V-l infection along with A59 infec¬ 
tion provides useful models to demyelination, in that mice do not die of 
acute encephalitis (Bergmann et al., 2001; Lavi et al., 1984a,b). JHM.IA 
infection of suckling mice, passively immunized, provides another model 
that has been used to study MHV-induced demyelination (Pewe et al., 1996). 

2. Hepatitis 

MHV-induced hepatitis has been studied using several strains, including 
highly hepatovirulent MHV-3 and MHV-2 and the more moderately 
hepatotropic A59. The MHV-3 strain, most commonly used to study the 
pathogenesis of MHV-induced hepatitis, was isolated from a VS weanling 
mouse that developed acute hepatitis after inoculation with serum from a 
patient with acute hepatitis (Dick et al., 1956). A liver homogenate from 
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this initial isolate produced no clinical signs when inoculated into naive 
mice. However, as with the neurovirulence of JHM, following serial 
passage of MHV3 in suckling or weanling mice, a virus emerged that 
caused fulminant hepatitis that was lethal for weanling VS mice. This 
virus was primarily hepatotropic, producing massive hepatic necrosis 
and has been called MHV-3 (Dick et al., 1956). 

The extent of liver pathology induced by MHV-3 is dependent on the 
age and the strain of the mouse (Le Prevost et al., 1975). Most strains, 
including DBA/2, BALB/c, and C57BL/6 are highly susceptible to lethal 
disease. However, A/J mice are highly resistant and C3H mice are semi- 
susceptible (Le Prevost et al., 1975). Pathology, characterized by necrotic 
foci and inflammatory infiltrates of neutrophils and mononuclear cells 
(Dick et al., 1956), develops quickly after infection of susceptible mice and 
peaks at 3-4 days postinfection, coinciding with the peak of viral replica¬ 
tion, with death occurring 4-7 days after infection. MHV-3 induced hepa¬ 
titis is characterized by abnormalities in blood flow, including the 
development of micro thrombi in the liver sinusoids (Bloch et al., 1975; 
Levy et al., 1983). Levy et al. (1981) observed that MHV-3 infection of 
peripheral blood mononuclear cells (PBMCs) from susceptible mice 
induces a procoagulant activity (PCA) and that induction of PCA expres¬ 
sion in monocytes in response to MHV-3 infection correlated with sus¬ 
ceptibility to disease. Furthermore, the PCA activity is encoded by the fgl2 
gene, which is induced at the transcriptional level during MHV infection, 
specifically by the MHV-3 nucleocapsid and not by nucleocapsids from 
nonhepatotropic strains (Ning et al., 2003, 1999). MHV-3 infection failed to 
induce the expression of PCA in macrophages from fgl2 null mice in vitro 
and in vivo, supporting an important role for fgl2 encoded PCA in the 
pathogenesis of MHV-3 induced hepatitis. This loss of PCA was reflected 
by an almost complete absence of fibrin deposition in the liver and 
hepatocellular necrosis at 3 days postinfection. Interestingly, it is not 
clear that induction of the fg2 gene is a common feature of hepatitis 
induced by the other MHV strains, such as A59 (data not shown). 

The adaptive immune response to MHV-3 differs between susceptible 
and resistant mouse strains. In susceptible, but not resistant mouse 
strains, MHV-3 infection results in necrosis and destruction of splenic 
and lymphoid follicles (Hirano and Ruebner, 1965; Lamontagne et al., 
1989; Virelizier et al., 1975; Yamada et al., 1979). T and B cells from 
susceptible, but not resistant, mice infected with MHV-3 in vitro were 
permissive to viral replication and underwent cell lysis (Lamontagne 
et al., 1989). In addition, antibody responses to MHV-3 were undetectable 
in BALB/c mice up to death at 5 days postinfection, in contrast to A/J 
mice that began to mount a robust antibody response by that time (Levy 
et al., 1984). Like JHM.SD infection of the CNS, high virulence is 
associated with an inability to induce a robust T-cell response. 
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Using well-characterized moderately hepatotropic A59, nonhepatotro- 
pic JHM, and severely hepatotropic MHV-2 strains, reverse genetics was 
used to map the viral genes that influence the induction of hepatitis, as 
will be discussed in detail below. These studies showed that spike protein 
is a major determinant of hepatovirulence but that one or more back¬ 
ground genes in the 3' end of the genome are also influential. Further¬ 
more, the ns2 protein, an interferon antagonist encoded in the genomes of 
all known MHV strains, is necessary but in the case of JHM not sufficient 
for the induction of hepatitis (Zhao et al., 2011). 

3. Pneumonitis 

The MHV-1 strain is primarily pneumovirulent, different from the previ¬ 
ously discussed strains. MHV-l-induced pneumonitis is highly mouse 
strain dependent; A/J mice, resistant to MHV-3 induced hepatitis, are the 
most susceptible. While Balb /c and C57B1/6 mice are resistant to MHV-l- 
induced pulmonary disease, MHV-1 infection of A/J mice provides a 
mouse model for the pathogenesis of SARS-CoV in humans (De 
Albuquerque et al., 2006). Following intranasal infection of MHV-1, A/J 
mice develop consolidated pneumonitis characterized by hyaline mem¬ 
branes, fibrin deposition and lymphocytic and macrophage infiltration 
and die by 7 days postinfection. Virions are found mostly localized to 
pulmonary macrophages. C3H/HeJ mice exhibit an intermediate pattern 
of resistance / susceptibility, developing chronic pulmonary fibrosis and 
bronchial hyperplasia with 40% of the mice dying by day 28. MHV-1 
replicated in all mouse strains, regardless of susceptibility to disease, 
suggesting that the development of pneumonitis was a result of the host 
immune responses. One of the striking differences between infection of 
susceptible A/J mice and resistant Balb/c and C57B1/6 mice was the less 
robust type I interferon response in A/J mice. In contrast to the type IIFN 
response, A/J mice respond to infection with higher levels of cytokines 
including macrophage chemo-attractant protein 1(MCP-1/CCL2), IFN-y, 
and TNF-a. In addition, in A/J the expression of fgl2 and fibrin deposition 
were markedly increased (De Albuquerque et al., 2006; Leibowitz et al., 
2010). Thus mice susceptible to MHV-3 induced hepatitis bear some 
similarities in cytokine response to A/J mice infected with MHV-1. 

B. SARS-CoV pathogenesis 

SARS is a novel infectious disorder that was first diagnosed in China in 
November 2002 and subsequently spread worldwide (Booth et al., 2003; 
Dwosh et al., 2003; Holmes, 2003; Ksiazek et al., 2003; Lee et al., 2003; Peiris 
et al., 2003b; Poutanen et al., 2003; Tsang et al., 2003; Varia et al., 2003; 
WHO, 2003). SARS was documented in over 8000 persons with 778 deaths 
(WHO, 2003) before the outbreak was extinguished. In 2004, laboratory- 
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associated cases in Singapore, Taiwan, and Beijing were reported, as were 
four nonlaboratory associated cases in Guandong Province, P.R.C. (WHO, 
2004), underlining the possibility of reemergence of SARS. Spread of 
SARS was via airborne droplets and through fomites (Donnelly et ah, 
2003). Electron microscopy, virus isolation, cloning, and sequencing stud¬ 
ies demonstrated that a novel coronavirus was the etiologic agent of SARS 
(Drosten et ah, 2003; Ksiazek et ah, 2003; Marra et ah, 2003; Peiris et ah, 
2003b; Rota et ah, 2003). Shortly thereafter, the coronavirus etiology of 
SARS was confirmed when Koch's postulates were fulfilled using cyno- 
molgus macaques (Macaca fascicularis) (Fouchier et ah, 2003). Although the 
SARS-CoV was initially thought to represent a novel coronavirus sub¬ 
group (Marra et ah, 2003; Rota et ah, 2003), subsequent more extensive 
phylogenetic analyses place it as an early branch of the betacoronaviruses, 
the genus that includes the MHV (Eickmann et ah, 2003; Snijder et ah, 2003; 
Zhu and Chen, 2004). 

Clinically, patients with SARS had a triphasic pattern of disease (Peiris 
et ah, 2003a). Patients most frequently initially presented with fever, a non¬ 
productive cough, sore throat, and myalgia, with dyspnea often not becom¬ 
ing a prominent feature until days 7-14 of the illness. During the second 
phase of the illness, dyspnea and hypoxia, with continued fever and fre¬ 
quently accompanied by diarrhea, became more prominent. Some patient's 
respiratory status continued to deteriorate and they developed acute respi¬ 
ratory distress syndrome often requiring mechanical respiration by the third 
week. Deaths occurred as early as day 4 and as late as 108 days after onset. 
Virus shedding from the respiratory tract generally peaked around day 10 
and subsequently declined. Virus excretion from the GI tract was frequently 
present. IgG antibodies were detected 10-15 days after onset and their 
development was associated with decreased virus load. The severity of the 
disease was correlated with increasing age, with mortality reaching 50% for 
patients over 60 (Booth et ah, 2003; Chan et ah, 2003; Donnelly et ah, 2003; Lee 
et ah, 2003; Peiris et ah, 2003a,c; Tsui et ah, 2003). 

The primary pathology observed at autopsy of patients that suc¬ 
cumbed to infection was diffuse alveolar damage (Ding et ah, 2003; 
Franks et ah, 2003; Hwang et ah, 2005; Nicholls et ah, 2003). The lungs of 
patients that died in the early phases of the disease contained hyaline 
membranes, edema, fibrin exudates, small vessel thrombi, loss and 
sloughing of pneumocytes, and a mixed cellular infiltrate of lymphocytes, 
macrophages, and polymorphonuclear leukocytes. Multinucleated giant 
cells that carried markers for macrophages and pneumocytes were fre¬ 
quently present. At later phases of the disease, a histologic picture of an 
organizing pneumonitis and consolidation, with type II pneumocyte 
hyperplasia, squamous metaplasia, and bronchiolitis obilterans, was 
found. The association of worsening clinical progression with declining 
virus loads and the onset of an immunological response, plus the 
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presence of markedly elevated cytokines levels suggested that severe 
lung damage was largely immunopathological in nature (Beijing Group 
of National Research Project for SARS, 2003; Cameron et ah, 2007; He et ah, 
2006; Nicholls et ah, 2003; Peiris et al., 2003a; Wong et ah, 2004a). 

The zoonotic origin of the SARS outbreak has recently been reviewed 
(Graham and Baric, 2010; Yip et ah, 2009). The earliest cases of SARS in 
Guandong, P.R.C., were disproportionally in workers at wild animal 
markets. Subsequent studies of wild caught animals in these markets 
detected evidence of SARS-CoV infection in Himalayan palm civets 
(Paguma larvata) and raccoon dogs ( Nyctereutes procyonoides) suggesting 
these species as possible sources for human infections (Guan et ah, 2003). 
Attention was focused on civets because of their longer time of virus 
excretion, and epidemiological studies, including finding additional 
cases of SARS among food handlers in restaurants that served civet 
meat (Wang et ah, 2005; Xu et ah, 2004). Sequence comparisons of SARS- 
CoVs isolated from civets and patients supported the transmission from 
civets to humans (Hu et ah, 2003; Song et ah, 2005; Yeh et ah, 2004). Elegant 
work demonstrating that ACE2 is the SARS-CoV receptor, characterizing 
the receptor-binding domain (RBD) including a determination of its 
structure bound to ACE2, and characterization of the key residues 
involved in adaptation of the SARS-CoV RBD to the human ACE2 has 
illuminated the structural changes that evolved to enable efficient human 
infection (Li et ah, 2005a,c, 2003; Wong et ah, 2004b) to be further discussed 
below. However, civets in the wild did not have evidence of current or 
past infection with SARS-CoV making them unlikely as the natural host 
for SARS-CoV (Kan et ah, 2005). The discovery of SARS-like bat corona- 
viruses with approximately 90% sequence identity with SARS-CoV in 
Chinese horseshoe bats (Rhinolophus sinicus) suggests that this or a related 
species of bat is likely origin of SARS-CoV (Lau et ah, 2005; Li et ah, 2005b). 

A number of animal models for SARS were developed during and 
after the SARS outbreak. Three excellent reviews of these models are 
available (Nagata et ah, 2010; Roberts et ah, 2007b; Subbarao and 
Roberts, 2006); thus these models will be only be briefly reviewed here. 
These include nonhuman primate models employing SARS-CoV isolates 
from later phases of the epidemic and cynomolgus macaques (M. fascicu- 
laris ) (Fouchier et ah, 2003; Kuiken et ah, 2003; Rowe et ah, 2004), rhesus 
macaques ( Macaca nmlatta) (McAuliffe et ah, 2004; Qin et ah, 2005; Rowe 
et ah, 2004) African Green monkeys ( Cercopithecus atheiops or Chlorocebus 
sabeus) (McAuliffe et ah, 2004), and in marmosets ( Callithrix jacchns) 
(Greenough et ah, 2005). Although all of these animals support the repli¬ 
cation of SARS-CoV in their respiratory tracts, most develop relatively 
mild disease. In addition, the degree of the severity of the disease and 
pathology observed in cynomolgus macaques by different workers was 
variable. SARS-CoV is able to infect cats ( Felis domesticus) and ferrets 
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(Mustela furo) (Martina et al., 2003) and although infection does not pro¬ 
duce severe disease in either species, the ferret model has been utilized in 
protection studies and to study the host response to infection (Chu et al., 
2008; Czub et al., 2005; Danesh et al., 2011; ter Meulen et al., 2004; 
Weingartl et al., 2004). 

Multiple rodent models of SARS have been developed. Young BALB/c 
mice can be infected by SARS-CoV but develop minimal pathologic 
changes and no disease but virus does replicate for a short period of time 
in the respiratory tract, reaching substantial titers in the lung (Subbarao 
et al., 2004). Similar to the case with human SARS-CoV infections, aged 
mice develop more severe disease than young mice with greater viral 
replication in the lungs, evidence of clinical illness, and the histologic 
changes of interstitial pneumonitis with alveolar damage, similar to that 
observed in human SARS (Roberts et al., 2005a). In contrast to 4-6-week- 
old mice, BALB / c mice 12-14 months of age had elevated levels of IFN-a, 
IFN-y, and TNF-a early in infection, suggesting that high levels of proin- 
flammatory cytokines contribute to the more severe disease observed with 
increased age. Infection of Syrian Golden hamsters with SARS-CoV results 
in an acute interstitial pneumonitis and lymphocytic inflammatory lesions 
in the liver (not typically seen in human SARS) without clinical symptoms, 
and virus was completely cleared by day 14 (Roberts et al., 2005b). Several 
mouse-adapted strains of SARS-CoV have been developed (Day et al., 
2009; Nagata et al., 2008; Roberts et al., 2007a) with the MA15 strain 
developed by the Baric lab being the most extensively studied. These 
viruses produce severe lethal disease resembling SARS in young (Day 
et al., 2009; Roberts et al., 2007a) or aged mice (Nagata et al., 2008; Roberts 
et al., 2007a). Two of these viruses have been sequenced and both carry 
identical Y436H mutations in the RBD of the S protein as well as other 
mutations in replicase (nsp3, nsp 5, nsp9, nspl3), structural (M and addi¬ 
tional S mutations), and accessory (3b) proteins (Day et al., 2009; Roberts 
et al., 2007a). In addition, a rat-adapted strain of SARS-CoV has also been 
developed that replicates more efficiently in rats than the parental Frank- 
fortl strain from which it was derived and produces clinical disease and 
pneumonitis with diffuse alveolar damage in 6-month-old rats (Nagata 
et al., 2007). This virus contains a mutation in the RBD of the S protein, 
allowing it to bind more efficiently to rat ACE2, the SARS-CoV receptor 
(Nagata et al., 2007). Thus adaptation of the S protein to rodent ACE2 
appears to be a significant element of the evolution to strains pathogenic 
for rodents. Transgenic mouse models in which human ACE2 was 
expressed in the respiratory tract as well as other tissues have been devel¬ 
oped (McCray et al., 2007; Tseng et al., 2007). In both of these models, the 
transgenic mice rapidly succumb after intranasal inoculation of SARS- 
CoV due to infection of the CNS, limiting their usefulness in the pathogen¬ 
esis of SARS (Netland et al., 2008; Tseng et al., 2007). 
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Genetic knockout mice have been extensively employed to identify 
important elements of host immunity that contribute to the pathogenesis 
of SARS. Infection of beige, CD1 / “, and RAGl~ /_ mice resulted in 
minimal pulmonary pathology, and virus grew to similar titers in the 
lungs and was cleared with similar kinetics to that observed in wild-type 
C57B1/6 mice, demonstrating that NK cells and adaptive cellular immu¬ 
nity are not required for viral clearance (Glass et al., 2004). However, 
recent experiments using adoptive transfer have demonstrated that 
virus-specific T-cells derived from immunized mice ameliorate the devel¬ 
opment of disease and pulmonary pathology and decrease mortality in 
mice challenged with mouse-adapted MA15 (Zhao and Perlman, 2010) 
Infection of type I, type II, or type III interferon receptor knockout mice on 
a strain 129 background, with SARS-CoV (Urbani strain) or the mouse- 
adapted MA15 virus, resulted in clinical disease and pathologic changes 
identical to that observed in wild-type strain 129 mice (Frieman et al., 
2010). This contrasted with the results obtained with a STATI knockout 
where genetic ablation of STATI increased the severity of disease with 
both the MA15 and the Urbani strains of SARS-CoV. This suggests that 
STATI may contribute to SARS-CoV pathogenesis by an interferon inde¬ 
pendent mechanism and it has been speculated that this is related to its 
role in regulating cell proliferation (Frieman et al., 2010). Microarray 
analysis of lungs harvested from IFNRAl~ /_ mice demonstrated strong 
expression of interferon-stimulated genes in spite of the lack of type I 
interferon receptors (Zornetzer et al., 2010). In contrast, STATl _/_ mice 
exhibited a defect in the expression of interferon-stimulated genes and 
were unable to clear the infection, resulting in a lethal outcome (Frieman 
et al., 2010). Microarray data suggested dysregulation of T-cell and mac¬ 
rophage differentiation, with a TH2-biased immune response and a pro- 
fibrotic environment within the lung (Zornetzer et al., 2010). Infection of 
mice in which MyD88 was genetically ablated with MA15 resulted in 
increased mortality and pulmonary pathology with higher viral loads in 
lung, compared to MA15 infection of wild-type mice (Sheahan et al., 2008). 
In spite of the high viral loads the transcription of proinflammatory 
cytokine and chemokine genes in lung, and recruitment of macrophages 
to the lung were severely impaired. Mice in which the CCR1, CCR2, or 
CCR5 chemokine receptors had been genetically ablated also had more 
severe disease (Sheahan et al., 2008), suggesting a role for macrophage 
recruitment in controlling the disease. 

Other host factors have also been implicated in the pathogenesis of 
SARS, primarily from work on murine models. Multiple SARS-CoV pro¬ 
teins have been reported to interact with components of the innate 
immune system to evade an antiviral interferon response, and these are 
discussed below with the individual proteins that have been implicated in 
this process. The expression of ACE2, the SARS-CoV receptor, on the 
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surface of cells is downregulated after infection with SARS-CoV (Kuba 
et al., 2005). The mechanism of this downregulation appears to be due to 
internalization of ACE2 during SARS-CoV entry (Wang et al., 2008) and 
by induction of tumor necrosis factor alpha converting enzyme activity or 
Adams family metalloproteases which cleave the ACE2 extracellular 
domain from its transmembrane domain, resulting in shedding of this 
domain into the media (Haga et al., 2008). ACE2 has a pneumoprotective 
effect on acute lung injury induced by acid injury (Imai et al., 2005), and 
instillation of a recombinant fusion protein containing the SARS S protein 
RBD increased acute lung injury by acid (Kuba et al., 2005). These results 
have led to the hypothesis that the binding of SARS-CoV S protein is a 
virulence factor for SARS above and beyond its role in viral attachment 
and entry. Furthermore, in a mouse model, SARS-CoV replication in 
myocardium during pulmonary infection correlated with downregula¬ 
tion of ACE2 in the heart (Oudit et al., 2009). This data combined with the 
detection of inflammatory lesions and viral replication in myocardial 
tissue of patients that died of SARS suggests that downregulation of 
ACE2 and cardiac infection could contribute to SARS mortality (Oudit 
et al., 2009). As described in more detail in Section V.A.2 below, several 
different proteases, including cathepsin L (Simmons et al., 2005) and the 
serine protease TMPRSS2 (Matsuyama et al., 2010; Shulla et al., 2011) have 
been reported to affect SARS-CoV entry through cleavage of the spike 
protein and activation of its membrane fusion activity. A large number of 
noncoding RNAs have also been demonstrated to be differentially regu¬ 
lated during infection of mice with MA15 (Peng et al., 2010). About 40% of 
these noncoding RNAs are similarly regulated during in vitro infection of 
mouse embryonic fibroblasts with mouse-adapted influenza virus and by 
interferon treatment, suggesting that these noncoding RNAs may play a 
role in regulating the host response to virus infection, particularly the 
innate immune response. 


IV. CORONAVIRUS REVERSE GENETICS 

The development of coronavirus reverse genetic systems has greatly 
enhanced our understanding of coronavirus replication and pathogene¬ 
sis. This is particularly true in regard to illuminating the functions of viral 
proteins that interact with host proteins that are part of the host response 
to infection. We briefly review the various reverse genetic systems that are 
available for the different coronaviruses. For more detailed information 
about the various approaches that have been employed, the reader can 
consult several excellent reviews (Baric and Sims, 2005; Enjuanes et al., 
2005; Masters, 2006; Masters and Rottier, 2005; Thiel and Siddell, 2005) 
and the primary literature on each reverse genetic system. 
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The large size of the coronavirus genome, 27-32 kb, presented serious 
obstacles to developing reverse genetic systems similar to those used for 
smaller positive sense RNA viruses, where a cDNA clone of the genome is 
transcribed in vitro and the RNA product is transfected into permissive 
cells to regenerate infectious virus. These obstacles were due to both the 
large size of cDNAs corresponding to complete coronavirus genomes, 
and to the instability of various portions of coronavirus replicase genes 
when cloned into conventional E. coli plasmid vectors (Almazan et al., 
2000; Yount et al., 2000). This delayed the development of reverse genetic 
systems for these viruses for a number of years following the completion 
of the first coronavirus sequence (Boursnell et al., 1987) and resulted in the 
development of alternatives to more conventional plasmid-based 
approaches. 

A. Targeted recombination 

The first coronavirus reverse genetic system that was developed was 
targeted recombination for MHV, strain A59 (MHV-A59), and took 
advantage of the phenotype of a particular temperature-sensitive mutant 
virus, Alb4, that contained a small in-frame deletion of the N gene that 
rendered it both much more sensitive to thermal inactivation at 40 °C than 
wild-type MHV-A59, and conferred a temperature-sensitive phenotype 
in that it produced a lesser number of very small plaques at 39 °C that 
were easily distinguished from those formed by wild-type virus under 
identical conditions (Masters et al., 1994). In this system, a synthetic 
defective interfering (DI) RNA consisting of the first 467 nucleotides of 
the MHV genome fused to 48 nucleotides derived from the VSV N gene, 
fused in turn to the MHV N coding sequence followed by the MHV 3'UTR 
and a poly (A) tail functioned as an RNA replicon that could recombine 
with Alb 4, thus introducing mutations engineered into the N gene or 
3'UTR of the resulting recombinant viruses. This system allowed the 
efficient recovery of viruses that contained mutations introduced into 
the N gene or the 3' UTR, but was less efficient in introducing mutations 
into genes 5' of the N gene. 

Subsequently, the targeted recombination methodology was signifi¬ 
cantly advanced by taking advantage of the fact that the host range of 
coronaviruses is largely controlled at the entry step, thus providing a 
powerful means of selecting recombinant viruses (Kuo et al., 2000). In 
this system a cDNA clone representing a synthetic DI RNA has been 
enlarged to now contain all of the MHV sequences from codon 28 of the 
HE gene to the 3'UTR (pMH54, see Fig. 3). To create an appropriate 
acceptor virus, Kuo et al. (2000) created a donor DI RNA in which the 
sequences encoding the ectodomain of the S protein were replaced by the 
corresponding sequences of feline infectious peritonitis virus (FIPV). 
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FIGURE 3 Targeted recombination. A schematic representation of targeted recombi¬ 
nation. A59 sequences are shown in blue, FIPV S ectodomain sequences are shown in 
yellow, and JHM sequences are shown in red. Based on Kuo et al. (2000). 


After transfection with this donor DI RNA into cells infected with MHV- 
A59, recombinant viruses (fMHV) in which the MHV S ectodomain coding 
sequences replaced by their FIPV counterparts were selected by their ability 
to grow in feline cells but not murine cells. FMHV can then be used as the 
acceptor virus in the reverse process, where feline cells are infected with 
fMHV and transfected with pMH54 or mutation-containing derivatives, and 
recombinants viruses containing the introduced mutations selected on 
murine cells. The powerful host range selection enables manipulation of 
the sequences extending from the S gene through the 3'UTR, a region that 
includes all of the essential MHV structural genes and is sufficiently efficient 
to allow the isolation of mutants with very crippled phenotypes (Kuo and 
Masters, 2003). Although the targeted recombination system was created for 
MHV-A59, it was extended to the JHM strain of MHV (MHV-JHM) 
(Ontiveros et al., 2001) and to MHV-1 (B.M. McGruder and J.L. Leibowitz, 
unpublished). In principle, a system similar to this can be created for any 
coronavirus that replicates in cultured cells. 


B. Reverse genetic systems that regenerate virus from cDNA 

A significant limitation of the targeted recombination system is the inabil¬ 
ity to easily genetically manipulate genes upstream of the S gene, and for 
introducing mutations or extensive chimeric sequences downstream of 
the S gene the need to screen recombinant viruses by sequencing to avoid 
selecting viruses in which a double crossover has occurred to produce a 
virus that lacks the desired genotype. To overcome this limitation, reverse 
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genetic systems that regenerate virus from cDNA copies of the genome 
have been developed. As alluded to above, the large size of the coronavi- 
rus genome and the instability of portions of the coronavirus replicase 
gene posed significant obstacles to the development of reverse genetic 
systems for coronaviruses (Almazan et al., 2000; Baric and Yount, 2000). 

1. Transmissible gastroenteritis virus 

The first full-length infectious cDNA clone of a coronavirus was created 
for transmissible gastroenteritis virus (TGEV) by Almazan et al. (2000). 
Initial attempts at assembling cDNAs representing the entire TGEV 
genome by stepwise ligation in a plasmid vector failed due to instability 
that could be remedied by omitting one 5.2 kbp fragment derived from 
the replicase. This difficulty was circumvented by transferring the cDNA 
lacking the 5.2 kbp replicase fragment to a bacterial artificial chromosome 
(BAC) and then ligating the missing replicase fragment into the BAC to 
reconstruct the complete TGEV genome. The cDNA was under the con¬ 
trol of a cytomegalovirus (CMV) promoter, and the hepatitis delta virus 
ribozyme is down stream of a poly (A) tail to ensure that the transcript 
contains the correct 3' end. Virus is regenerated by transfection into a 
TGEV-permissive cell, where after translocation of the BAC to the 
nucleus, transcription of the cDNA regenerates the TGEV genome, 
which is subsequently exported to the cytoplasm. Once the genome 
reaches the cytoplasm, viral replication resumes normally. Although 
there are several potential splice sites in the TGEV genome, splicing 
appeared to occur at only low levels, allowing efficient recovery of 
virus. Although the BAC TGEV clone was stable in E. coli, additional 
stabilization was subsequently obtained by inserting an intron into the 
regions of the ORF1 gene that are associated with cDNA instability in 
bacteria (Gonzalez et al., 2002). 

Almost simultaneously with the development of the BAC-based 
reverse genetic system for TGEV, a second reverse genetic system for 
TGEV was created using an in vitro cDNA assembly approach (Yount 
et al., 2000). This approach molecularly cloned the TGEV genome as six 
cDNAs, which together spanned the entire TGEV genome. These cDNAs 
were created by RT-PCR such that the cDNA which represented the 5' end 
of the genome was immediately downstream of a T7 promoter, and the 
cDNA ends contained restriction sites for a subset of restriction enzymes 
(such as Bgll, recognition site GCCNNNN [ NGGC) that leave sticky ends 
that are arbitrary in sequence and which will occur very infrequently, 
making it possible to assemble these cDNAs by in vitro ligation. As was 
the case with the BAC-based system described above, plasmid instability 
made it necessary to divide one replicase cDNA fragment into two sepa¬ 
rate clones in order to stably maintain the cDNAs in E. coli. The separately 
cloned cDNAs were then excised from plasmids by restriction enzyme 
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digestion and ligated together to assemble a cDNA that corresponds to 
the complete TGEV genome. The assembled TGEV cDNA was used as 
template for in vitro transcription to generate capped high molecular 
weight RNAs. To enhance the recovery of infectious virus, transcripts of 
the TGEV N gene were separately synthesized in vitro and mixed with the 
full-length cDNA transcripts, and the mixture electroporated into BHK 
cells. Although BHK cells do not express the porcine amino peptidase N 
receptor for TGEV and thus cannot support secondary rounds of infec¬ 
tion, they are fully permissive for transfected RNA and were used 
because of their high electroporation efficiency. The electroporated cells 
were then seeded with the fully TGEV-permissive and infectable swine 
testicle (ST) cell line to recover infectious TGEV. 

Each of these two approaches to generating a reverse genetic system has 
its advantages and disadvantages. The BAC approach using a CMV pro¬ 
moter is simpler in that the viral cDNA is propagated as a single clone in 
E. coli, and because regenerating virus relies on host cell transcription rather 
than in vitro transcription to regenerate virus genomes, it is considerably less 
expensive than other approaches utilizing in vitro transcription withT7 RNA 
polymerase. However, a disadvantage of the BAC system is that genetic 
manipulation of the cDNA clone in the BAC is not as facile compared to the 
in vitro cDNA assembly system where the separation of the cDNA into 
multiple smaller fragments facilitates the introduction of mutations. 


2. Human coronavirus 229E 

Thiel et al. (2001a,b) used a vaccinia virus-based approach to develop a 
reverse genetic system for human coronavirus 229E (HCoV-229E). This 
was necessary due to the instability of a region within orfla (approxi¬ 
mately nts 5200-7000) when cloned into plasmid vectors in E. coli. A series 
of cloned cDNAs were ligated with an RT-PCR amplicon containing the 
unstable region to assemble a cDNA, which together represented the 
entire replicase region of HCoV-229E. To obviate the problem of plasmid 
instability, the assembled replicase cDNAs were ligated to Notl-cleaved 
vNotl/tk vaccinia virus DNA (Merchlinsky and Moss, 1992), and a 
recombinant vaccinia virus containing this HCoV-229E cDNA under the 
control of a T7 promoter was then recovered by transfecting the ligation 
products into fowlpoxvirus (FPV)-infected cells (Thiel et al., 2001b). DNA 
from the recovered recombinant vaccinia virus was subsequently 
extracted and ligated in vitro to a second cDNA clone representing the 
remaining 3' portion of the genome. The resulting cDNA which corre¬ 
sponded to the complete HCoV-229E genome was then ligated to Notl- 
cleaved vNotl/tk vaccinia virus DNA (Merchlinsky and Moss, 1992), and 
a second recombinant vaccinia virus was then recovered by transfecting 
the ligation products into fowlpoxvirus-infected cells. The vaccinia virus 
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DNA was then extracted and digested with Clal (Clal sites are absent in 
the HCoV-229E cDNA, but one is present just 3' to the HCoV-229E cDNA 
insert), and capped RNA corresponding to the HCoV-229E genome was 
synthesized by in vitro transcription with T7 RNA polymerase in the 
presence of cap analog. Transfection with this RNA regenerated infec¬ 
tious HCoV-229E. Mutations can be introduced into the HCoV-229E 
cDNA by vaccine virus-mediated homologous recombination with trans¬ 
fected plasmid DNA carrying the desired mutation and the E. coli gpt gene 
by sequential selection for and against ypf-containing recombinant vac¬ 
cinia viruses (Falkner and Moss, 1988; Thiel et al., 2001a) or by trans¬ 
dominant selection (Falkner and Moss, 1990). 

3. Mouse hepatitis virus 

Two different approaches have been employed to develop reverse genetic 
systems for MHV. Both systems allow relatively facile genetic manipula¬ 
tion of MHV-A59. The Baric lab utilized a cDNA fragment assembly 
approach similar to the one they utilized for TGEV to develop a plas¬ 
mid-based system in which the MHV-A59 genome is cloned in seven 
fragments (Yount et al., 2002). One improvement made in this system 
was the incorporation of type IIS restriction enzyme recognition sites 
(such as Esp3I) at the ends of the amplified and cloned fragments. Type 
IIS restriction enzymes recognize a strand-specific sequence rather than a 
palindromic sequence and cleave double-stranded DNA such that the 
restriction enzyme recognition site is cleaved from the DNA, but leave 
an overhang that can be an arbitrary sequence (lower case sequences in 
Fig. 4), permitting the ligation of any two cDNAs in which compatible 
overhangs have been engineered. This allowed the joining of the seven 
MHV-A59 cDNAs at points of the investigators' choosing rather than 
having to either rely on the existence of rare unique restriction sites or 
to have to engineer coding silent mutations to create unique sites. The 
assembled MHV cDNA was used as template for in vitro transcription to 
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FIGURE 4 Assembly strategy of full-length coronavirus genomes. The use of the type II 
S restriction enzyme Esp3l to ligate two cDNAs of arbitrary sequence. The Esp3l recog¬ 
nition site is shown in underlined upper case text. The arbitrary sequence at which the 
two cDNAs are joined is shown in lower case text. Based on Yount et al. (2002). 






Coronavirus Pathogenesis 105 


generate capped high molecular weight RNAs. As described above, the 
recovery of infectious virus was enhanced by adding in vitro synthesized 
transcripts of the MHV N gene to the full-length cDNA transcripts, and 
the mixture electroporated into BHK-R cells (Dveksler et al., 1991), a cell 
line that has been transformed with Ceacamla, the receptor for MHV. The 
electroporated cells are then overlaid onto fully permissive DBT cells to 
increase the yield of virus. The initial recombinant virus generated by this 
approach, MHV-1000, did not produce disease in mice after intracranial 
inoculation (Sperry et al., 2005). This attenuated phenotype was mapped 
to two mutations, one in nspl4, and the second in the ns2 gene. Neither 
mutation was in the predicted active sites of the two proteins. Correction 
of these mutations allows regeneration of recombinant MHV-A59 that is 
as virulent as the nonrecombinant virus. The Weiss lab has created a 
similar reverse genetic system for the widely studied JHM.SD strain of 
MHV (T.J. Cowley et al., unpublished). 

The second reverse genetic system for MHV-A59 employed vaccinia 
virus as a eukaryotic cloning vector (Coley et al., 2005). Four cDNAs 
representing the entire MHV-A59 genome were assembled from smaller 
cDNA clones that were stable in E. coli and from RT-PCR products with 
the 5' end of the genome immediately downstream of a T7 promoter. 
These four cDNAs were then ligated in vitro, and the resulting ligation 
product which contained Notl compatible ends and corresponded to full- 
length MHV-A59 cDNA was subsequently ligated to Notl-cleaved 
vNotl/tk vaccinia virus DNA (Merchlinsky and Moss, 1992). Recombi¬ 
nant vaccinia virus containing the MHV cDNA was recovered by trans¬ 
fecting the ligation products into fowlpoxvirus-infected cells, and the 
recombinant virus subsequently plaque cloned as described previously 
(Thiel et al., 2001a). Sequence errors inadvertently introduced into the 
MHV cDNA during the RT-PCR and cloning steps were corrected step¬ 
wise by four cycles of vaccinia virus-mediated homologous recombina¬ 
tion with transfected plasmids carrying the corrected sequence and the 
E. coli gpt gene with selection for and against gpt as described previously 
(Falkner and Moss, 1988; Thiel et al., 2001a). Large amounts of DNA can 
then be prepared from purified recombinant vaccinia virus, and the 
purified DNA containing the MHV-A59 cDNA digested with EagI to 
provide a template for T7 transcription. The MHV cDNA was transcribed 
in vitro to generate capped high molecular weight RNAs. Recovery of 
infectious virus was enhanced by adding in vitro synthesized transcripts 
of the MHV N gene prior to electroporation into BHK-21 cells, which are 
subsequently incubated with the MHV permissive murine fibroblast cell 
line, 17C1-1, to regenerate recombinant MHV-A59. This virus was virulent 
in mice. Mutant strains of recombinant MHV can be generated by 
employing vaccinia-mediated recombination as described above for cor¬ 
recting RT-PCR mutations. 
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4. Avian infectious bronchitis virus 

Two reverse genetic systems have been developed for avian infectious 
bronchitis virus (IBV) (Casais et al., 2001; Youn et al., 2005). Casais et al. 
(2001) utilized a vaccinia-based approach similar to those described 
above to ligate three cloned cDNAs into a single large cDNA represent¬ 
ing the entire IBV genome under the control of a T7 promoter. Recom¬ 
binant vaccinia virus containing the IBV genome was recovered as 
described above for HCoV-229E. Rather than using in vitro transcrip¬ 
tion to regenerate recombinant IBV, permissive chicken kidney cells 
were infected with rFPV-T7 to provide cytoplasmic T7 RNA polymer¬ 
ase and the poxvirus guanylyltransferase to cap any T7 transcripts, and 
at 1 h p.i., the cells were transfected with Sail- or Ascl-digested recom¬ 
binant vaccinia virus DNA containing the IBV full-length cDNA to 
generate IBV genomes in vivo, which subsequently replicated to gener¬ 
ate recombinant IBV. Youn et al. (2005) utilized the in vitro cDNA 
fragment assembly strategy followed by in vitro transcription to regen¬ 
erate infectious recombinant IBV from seven discrete cloned cDNAs. 
Both reverse genetic systems have been used to study various facets of 
IBV pathogenesis and replication, including the development of avian 
vaccines. 


5. SARS-coronavirus and related Bat-SARS-like coronavirus 

Two reverse genetic systems have been developed to study SARS-CoV, 
one based on the in vitro cDNA assembly approach (Yount et al., 2003); in 
the second reverse genetic system, a cDNA corresponding to the complete 
SARS-CoV genome was cloned into a BAC under the control of a CMV 
promoter and followed by the hepatitis delta virus ribozyme to create a 
correct 3' end during transcription from the transfected BAC (Almazan 
et al., 2006). These systems have been widely utilized to investigate the 
replication and pathogenesis of SARS. Using synthetic biology and the 
cDNA assembly approach, Becker et al. (2008) created a series of cloned 
cDNAs corresponding to the consensus sequence of several bat SARS-like 
CoVs (bat-SCoV) and subsequently attempted to recover this previously 
uncultivated virus. This effort failed although evidence for viral replica¬ 
tion in the electroporated cells was detected by RT-PCR, most likely due 
to not having a fully permissive bat cell line containing the cognate 
receptor (likely bat ACE2) for this virus. When the putative bat-SCoV 
RBD of the S protein was replaced by the homologous SARS-CoV RBD, 
the authors were able to recover and characterize infectious virus. This 
work illustrates the power of the combination of synthetic biology and 
coronavirus reverse genetics to generate coronavirus species that are only 
known from sequence information, but have not been successfully grown 
in cell culture. 
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6. Human coronaviruses OC43 and NL63 

A reverse genetic system for a mouse neurovirulent strain of the human 
coronavirus OC43 (HCoV-OC43) was created using a BAC system similar 
to that used for TGEV and SARS-CoV (St-Jean et al., 2006). Like the TGEV 
and SARS-CoV systems described above, the HCoV-OC43 reverse genetic 
system relies on transcription from a CMV promoter to transcribe HCoV- 
OC43 genome RNAs from the transfected BAC containing a cDNA clone 
of the complete HCoV-OC43 genome. The recovered virus was neuro¬ 
virulent for mice after intracranial injection, as was the parental virus. 

A reverse genetic system for HCoV-NL63 was created using a cDNA 
assembly approach in which five cDNAs representing the complete HCoV- 
NL63 genome were ligated in vitro and transcribed from a T7 promoter to 
regenerate viral genomes to recover recombinant virus by electroporation 
(Donaldson et al., 2008). This system was applied to demonstrate that 
HCoV-NL63 ORF3 was not essential for virus replication in cell culture 
and could be replaced by GFP to create a virus containing this marker. 

7. Feline coronavirus 

A reverse genetic system for type I feline coronavirus (FCoV) strain Black 
has been developed using a vaccinia-based approach (Tekes et al., 2008) 
similar to those described above for HCoV-229E, IBV, and MHV. This 
system was utilized to create two recombinant FCoVs in which the non- 
essential 3abc genes in the FCoV genome were replaced by GFP or Renilla 
luciferase genes to create recombinant viruses that are suitable for both 
in vivo and cell culture studies of FCoV. 


V. STRUCTURAL PROTEINS 

We review the coronavirus structural proteins, which have important 
functions in pathogenesis as well as virion assembly and structure. 
These include the membrane-spanning proteins found in all coronavirus 
virions, spike, membrane, small membrane, and the HE, expressed by a 
subset of coronaviruses. We then discuss the nucleocapsid protein com¬ 
plexes with virion RNA to form a helical encased structure and the I 
protein of unknown function. 

A. Spike protein (S) 

1. MHV spike protein 

The spike protein is a type I membrane protein that is inserted in the viral 
envelope to form the peplomers that both give the virions their character¬ 
istic crown-like morphology (Fig. 2) and interact with viral receptors to 


108 


Susan R. Weiss and Julian L. Leibowitz 


mediate viral entry as well as cell to cell spread, through their ability to 
induce membrane fusion. Spike is synthesized as an approximately 
120 kDa precursor that is cotranslationally glycosylated to obtain its 
final 180 kDa molecular weight. The S proteins of most MHV strains 
(with the notable exception of MHV-2) are cleaved by a cellular furan- 
like protease into two noncovalently associated approximately 90 kDa 
subunits, the N-terminal SI and C-terminal S2 (Frana et al., 1985; Sturman 
et al., 1985) (Fig. 5). Spike is assembled on the membrane as a trimer in 
which the SI subunits form a globular head structure and the S2 subunits 
form a transmembrane stalk. During infection, S attaches to the MF1V 
receptor CEACAMla and mediates viral entry, usually directly at the 
plasma membrane (Gallagher et al., 1991; Qiu et al., 2006), but MHV may 
also employ an endosomal route of entry (Eifart et al., 2007) and this may 
be cell type dependent. The spike protein of MHV-2 like that of SARS- 
CoV (see below) is not cleaved during synthesis; MHV-2 entry occurs via 
an endosomal route and requires cleavage by cathepsin in a low pH 
environment (Qiu et al., 2006). 

Recently, the crystal structure of the MHV spike N-terminal domain 
(NTD), the RBD, complexed with CEACAMla has been solved (Peng 
et al., 2011). Interestingly, the core structure of the NTD contains the 
same p-sandwich fold as in human galectins, suggesting binding activity 
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to carbohydrates, as well an additional structural motif that binds to the 
N-terminal Ig-like CEACAM1 domain. Interestingly, while the MHV 
spike does not bind to sugars and uses only the protein receptor CEA- 
CAMla, other coronavirus spikes, for example, those of BoCV, HCoV- 
OC43, TGEV (Krempl et al., 2000), and IBV (Niesters et al, 1987), bind to 
cells through a lectin-like activity. Peng et al. (2011) speculated that coro¬ 
navirus NTDs were derived originally from human galectins, which 
evolved over time to lose carbohydrate-binding ability and to gain the 
ability to interact with CEACAMla, while other coronaviruses spikes 
remained dependent on carbohydrate binding for cell attachment. 

The coronavirus spike has a major influence on viral tropism and 
pathogenic phenotype. In the case of MHV, selection by reverse genetic 
techniques and characterization of chimeric and mutant viruses has been 
extremely useful for the mapping of pathogenic properties. Exchange of 
the spike genes between the highly neurovirulent JHM.SD strain and the 
weakly neurovirulent, yet hepatotropic, A59 strain demonstrated that the 
spike is a major (but not sole) determinant of the high neurovirulence of 
JHM.SD (Navas and Weiss, 2003; Navas et al., 2001; Phillips et al., 2002). 
However, the role of spike in determining liver tropism is more compli¬ 
cated. When A59 background viruses differing only in spike were com¬ 
pared, the level of virulence was determined by the spike protein 
expressed (Navas et al., 2001). Thus, a chimeric virus expressing the 
spike of the highly hepatotropic MHV-2 strain from within the A59 
background genes (rA59/S M HV-2) wa s significantly more hepatotropic 
than A59, while a chimeric virus expressing the spike from the nonhepa- 
totropic JHM.SD within the A59 background (rA59/SJHM.SD) was less 
hepatotropic than A59. However, perhaps surprisingly, rA59/Sj H M.SD 
was able to replicate to a significant extent in the liver, albeit only at 
high dose; this is in contrast to JHM.SD that replicates at or near the 
limit of detection even when inoculated at very high doses. A recombi¬ 
nant virus expressing the hepatotropic A59 spike within the JHM back¬ 
ground (rJHM/SA 59 ) failed to infect the liver even at high doses of virus, 
demonstrating that JHM genetic background eliminates A59 spike-deter¬ 
mined hepatotropism. These data imply that liver tropism is at least in 
part determined by postentry events (Navas and Weiss, 2003). 

Comparison of the spike proteins of the many JHM isolates has been 
informative in elucidating the viral determinants of high neurovirulence. 
As a general rule, the most neurovirulent JHM isolates (e.g., JHM.SD, 
JHM-cl2) are able to spread cell-to-cell in a CEACAMla-independent 
manner, a process referred to as "receptor independent spread" or 
"RIS." There are several functional domains in the MHV spike that 
have been demonstrated to effect the ability to carry out RIS and to 
contribute to the corresponding highly neuropathogenic phenotype. 
These are: (1) the N-terminal RBD, originally defined as the first 330 
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amino acid of spike (Kubo et al., 1994), which encompasses the NTD 
described above (Peng et al., 2011), (2) the hypervariable domain (HVR) 
within SI, and (3) the two the heptad repeat domains (HR1 and HR2) 
within S2 (Fig. 5). The RBD binds to the NTD of the MHV receptor, 
CEACAM1, a member of the IgG superfamily. Single amino acid substi¬ 
tutions within the RBD have major effects on neurovirulence and organ 
tropism. The enhanced neurovirulence of JHM.SD over JHM.IA was 
mapped to a S310G substitution within the JHM.SD spike protein, confer¬ 
ring the ability to carry out RIS (Ontiveros et al., 2003). Characterization of 
chimeric JHM.SD/A59 recombinant viruses with exchanges of the RBDs 
demonstrated that CEACAMla-independent spread and the very high 
neurovirulence that accompanies RIS require both the RBD and the rest of 
the spike to be derived from JHM.SD (Tsai et al., 2003b). In addition to 
modulating neurovirulence, the RBD clearly also plays a role in hepato- 
tropism as a single Q159L amino acid substitution eliminates the ability of 
A59 to infect the liver and induce hepatitis while having no effect on 
neurovirulence (Leparc-Goffart et al., 1997,1998). Interestingly, the crystal 
structure of MHV spike NTD in complex with CE AC AMI a would predict 
that while Q159 does not directly interact with receptor, it would influ¬ 
ence the binding of R20 to CEACAMla receptor (Peng et al., 2011). A 
recent study demonstrated that the MHV-1 spike gene expressed within 
the A59 genome conferred pneumovirulence; however, other genes both 
within the 3' and 5' portions of the genome were required for full pneu¬ 
movirulence of MHV-1 (Leibowitz et al., 2010). 

The highly neurovirulent JHM.SD (Dalziel et al., 1986; Ontiveros et al., 
2003), JHM cl-2 (Taguchi et al, 1985), and JHM-DL (Wang et al, 1992), all 
capable of carrying out RIS, express spikes with relatively long HVRs. The 
neuroattenuated phenotypes of a group of monoclonal antibody escape 
variants of JHM.SD, for example, V5A13.1 (Fazakerley et al, 1992), are 
associated with single site mutations or deletions within the HVR (Dalziel 
et al, 1986; Gallagher and Buchmeier, 2001; Phillips and Weiss, 2001) and 
the spike of the neuroattenuated A59 strain contains a large deletion (52 
amino acids) within the HVR. As with the RBD, the long HVR of JHM.SD 
is, however, not alone sufficient to confer high neurovirulence in that 
replacement of the HVR of A59 with that of JHM.SD did not confer a 
highly neurovirulent phenotype to the virus (Phillips and Weiss, 2001). 
In addition, a spike containing SI of JHM.SD and S2 of A59 was unable to 
mediate RIS. These observations indicate that cooperation among several 
regions of spike, including RBD, the long HVR and S2, is likely required 
for the high neurovirulence conferred by the JHM.SD spike. Recent data 
suggest, however, that the long HVR is not required for high neuroviru¬ 
lence as JHM.WU has a large (approximately 400) nucleotide deletion in 
the HVR spike gene relative to that of JHM.SD (data not shown) and 
despite its inability to mediate RIS, is highly neurovirulent. 
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Finally, mutations within the HR domains, which undergo conforma¬ 
tional changes during the process of membrane fusion, also effect viru¬ 
lence and the ability to perform RIS. Most notably, substitution of amino 
acid 1114, L1114R or L1114F has been associated with neuroattenuation in 
several mutants. The spike protein of the OBLV60 mutant of JHM.SD, 
which is restricted in replication to the olfactory bulbs, contains three 
amino acid substitutions within HR1 that have been associated with the 
requirement for low pH for induction of fusion. However, the L1114R, 
alone is sufficient to confer neuroattenuation and restriction of viral 
replication to the olfactory bulbs (Gallagher et al., 1991; Tsai et al., 
2003a). An L1114F substitution has been identified both in the spike of 
the 2.2-V-l glial-tropic variant of JHM-DL (Wang et al., 1992) and in the 
spike of a highly attenuated soluble receptor-resistant mutant srr7, 
derived from JHM-cl2 (Saeki et al., 1997, 1998). This substitution is asso¬ 
ciated with an inability to induce RIS as well as with neuroattenuation 
and the restriction of infection to glial cells in the CNS (Matsuyama and 
Taguchi, 2002a,b; Taguchi and Matsuyama, 2002). It is curious that 
viruses expressing the JHM spike with a L1114F substitution have lost 
their tropism for neurons while the OBLV60 mutant, expressing a spike 
with the L1114R substitution, can readily infect neurons of the olfactory 
bulb in vivo. Thus, small changes within the HR domains, even different 
substitutions of the same residue, may result in alterations in 
spike/receptor interaction and subsequent virus entry and pathogenesis 
in vivo. 

High neurovirulence and the associated ability to carry out RIS is 
associated with less stable association of SI and S2 as compared with 
spike proteins that are CE AC AMI a dependent in order to mediate fusion, 
such that the conformational changes that lead to fusion are more easily 
triggered, even in the absence of CEACAMla (Gallagher and Buchmeier, 
2001; Krueger et al., 2001). Characterization of chimeric A59/JHM.SD 
viruses in which the SI and S2 subunits have been exchanged demon¬ 
strated that SI of JHM.SD was not alone sufficient to confer high neuro¬ 
virulence, underscoring the notion that the cooperation of many domains 
within spike are required for the full virulence. Further evidence for 
cooperation among spike domains, noncontiguous in the primary struc¬ 
ture comes from the observation that an E1035D substitution within HR1 
of S2 may overcome the Q159L substitution in the RBD, since a spike with 
both of these substitutions confers hepatotropism upon a recombinant 
A59 (Navas-Martin et al., 2005). Escape mutants selected by resistance to a 
monoclonal antibody mapping to the RBD had point mutations in the 
region of HR2 (Grosse and Siddell, 1994), providing further support for 
the interaction of these domains. Thus, the high neurovirulence conferred 
by the JHM.SD spike can be thought of as a perfect storm. Very small 
changes in the sequence can significantly reduce its virulence. 
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In an effort to understand whether the ability of MHV to perform RIS 
is truly independent of receptor or whether there is an alternative to 
CEACAMla, particularly in the brain, a tissue very poor in expression 
of CEACAMla, there have been attempts at identifying additional recep¬ 
tors. The most notable perhaps was the report that PSG16, a protein 
identified by expression from a cDNA isolated from a mouse brain 
library, when expressed in COS cells could mediate MHV entry (Chen 
et al., 1995). We recently confirmed that psgl6 mRNA is indeed expressed 
in the brain, more highly in neurons as compared to glial cells (Bender 
et al., 2010). However, the PSG16 isoform expressed by Chen et al. (1995), 
as well as other known isoforms are N-terminally truncated relative to 
other PSG family proteins (Zebhauser et ah, 2005) and thus lacked the 
sequences that interact with the MHV spike. We have recently cloned a 
novel full-length isoform of psgl6 that is also expressed in the brain, 
placenta, and retina but, like the truncated form, lacks MHV receptor 
activity when expressed on the surface of 293T cells (Phillips et al., 
submitted), suggesting that PSG16 does not mediate CEACAMla- 
independent spread of MHV. 

2. The SARS-CoV spike 

The interaction of SARS-CoV S protein with its cellular receptor, angio¬ 
tensin-converting enzyme (ACE)-2 is the major determinant of SARS-CoV 
host range. In contrast to MHV, which infects only mice, and to a limited 
extent rats, SARS-CoV isolates can infect a variety of species of animals 
other than humans, including palm civets and raccoon dogs in nature and 
in the laboratory mice and ferrets as well as nonhuman primates. The 
RBD of the SARS-CoV spike is not at the amino terminus of spike as it is 
for MHV; rather, the SARS-CoV RBD is a 192-amino acid region spanning 
residues 319-510 (Fig. 5). While the core domain of SARS-CoV spike is 
homologous to a similar region in other betacoronavirus spikes, a loop 
from residues 424-494, distinct from betacoronaviruses, is the so-called 
receptor-binding motif (RBM) that contacts ACE-2 directly. It was specu¬ 
lated that this binding loop may have been acquired from a human 
alphacoronavirus such as NL-63 which also uses ACE-2 as its receptor 
(Li et al., 2006). Comparisons of the sequences of highly pathogenic 
human isolates from the 2002-2003 SARS epidemic (e.g., TOR-2 or 
Urbana), viruses isolated from humans with milder infections in 2003- 
2004, viruses isolated from civets and raccoon dogs early in the epidemic 
and more recently bat SARS-like coronaviruses demonstrated that one or 
two amino acid substitutions in spike can have large effects on the inter¬ 
action of SARS-CoV spike with human ACE-2 receptor. Such changes 
were probably responsible for the adaptation of SARS-CoV into humans. 
The crystal structure of the SARS RBD with ACE-2 has been used to 
predict how spike variants interact with ACE-2. Two important residues. 
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within the RBM of the spikes of SARS-CoV isolates from humans during 
the 2002-2003 epidemic, that make contact with the receptor, are N479 
and T487 (Fig. 5). Most viruses isolated from palm civets encode K479, 
which is compatible with the palm civet ACE2; while the human ACE- 
2 prefers N479, the palm civet ACE-2 can equally accommodate K479 of 
the civet isolates or N479 of human isolates. Viruses isolated from more 
mild human cases of SARS in 2003-2004 encode S487 as do the palm civet 
isolates; these spikes bind less effectively to human ACE-2 than the T487 
containing spike, associated with the more pathogenic human isolates. 
These types of data have lead to the belief that the civet was the interme¬ 
diate species of transfer for the SARS-CoV from its animal reservoir into 
humans (Li et al., 2006). As discussed above, many SARS-CoV like viruses 
have been isolated from bats, leading to the belief that the reservoir for 
SARS-CoV is the bat (Lau et al., 2005; Li et al., 2005b). 

Unlike the spikes of most betacoronaviruses, the spike of SARS-CoV is 
not cleaved into SI and S2 subunits during synthesis. However, an endo- 
somal low pH requiring cleavage by cathepsin L takes place during viral 
entry, similar to that of MHV-2. The exact sites of cleavage and even the 
number of cleavage events required for viral entry and/or cell-to-cell 
fusion events have been elusive. However, it was recently reported that 
cathepsin was required for fusion during viral entry and a second leu- 
peptin-sensitive-like cleavage by a cellular protease was required for 
activation of cell to cell fusion (Simmons et al., 2011). Whittaker and 
coworkers proposed that there are two critical cleavage events, one at 
the SI/SI boundary and the other within S2 at R797 which act in concert 
to mediate membrane fusion and virus infectivity (Belouzard et al., 2009, 
2010). Two other labs reported that a transmembrane protease/serine 
subfamily member 2 (TMPRSS2) was shown to be colocalized with 
ACE2 on the cell surface and to enhance SARS-CoV entry (Matsuyama 
et al., 2010; Shulla et al., 2011). Since TMPRSS2 family proteases are found 
in the lung, these findings suggest that cleavage by this protease may be a 
determinant of viral tropism and pathogenesis during the initiation of 
SARS-CoV infection in vivo. Thus, the precise processing steps needed to 
activate the SARS-CoV are still not well understood. 

In addition to mediating virus entry, the SARS-CoV spike also has 
effects on regulation of the rennin angiotensin system, which are 
mediated by the downregulation of ACE2 expression on the plasma 
membrane, resulting from SARS-CoV infection (Inoue et al., 2007) (Haga 
et al., 2008; Rockx et al., 2009; Wang et al., 2008). The rennin-angiotensin 
system regulates blood pressure and fluid balance; this system is widely 
studied in the kidney, while little is known about regulation in the lung. 
ACE2 has been shown to be pneumoprotective in multiple models of lung 
injury, likely through its effect on degrading angiotensin II, a proinflam- 
matory mediator, synthesized by ACE-1 (Hamming et al., 2007; Imai et al., 
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2005; Kuba et al., 2005; Wosten-van Asperen et al., 2008; Zhang and Sun, 
2005). There are many known inhibitors of ACE-1 and the angiotensin II 
receptor that may have potential to ameliorate the effects of SARS-CoV 
induced lung pathology, a strategy yet to be explored. 

B. Small membrane (E) protein 

Coronavirus E proteins are small, 76-109 amino acid, integral transmem¬ 
brane proteins and are minor components of purified virus particles 
(Arbely et al., 2004; Corse and Machamer, 2000; Godet et al., 1992; Liu 
and Inglis, 1991; Raamsman et al., 2000; Yu et al., 1994). Rather than being 
expressed from a subgenomic mRNA solely dedicated to its expression as 
it is for the SARS-CoV, the E orf may be downstream of one (i.e., MHV; 
Leibowitz et al., 1988) or two (i.e., IBV; Liu et al., 1991) orfs encoding 
accessory genes that are expressed from the same mRNA as E. For IBV 
translation of the E protein, encoded in orf3c, downstream of the 3a and 
3b orfs, from subgenomic mRNA 3 has been shown to be mediated by an 
IRES that facilitates its translation (Liu and Inglis, 1992). It is not known if 
other coronaviruses also use this strategy to translate E protein from 
downstream orfs. The E protein contains three domains, a short N-termi- 
nal domain, an unusually long transmembrane domain (see below for 
discussion of topology), and a hydrophilic C-terminal domain. The C- 
terminal domain of E protein is palmitoylated (Boscarino et al., 2008; Liao 
et al., 2006; Yu et al., 1994) and ubiquitinated (Alvarez et al., 2010), and 
palmitoylation is required for proper virus assembly. 

The E protein plays an important role in assembly. Coexpression of E 
and M proteins is sufficient to direct the assembly of virus-like particles 
for most coronaviruses that have been examined (Baudoux et al., 1998; Bos 
et al., 1996; Corse and Machamer, 2000). Cross-linking experiments have 
further demonstrated an interaction between the E and M proteins (Corse 
and Machamer, 2003). This interaction appears to be largely mediated by 
their cytoplasmic tails, although there is also a role for the E protein alpha- 
helical transmembrane domain in proper assembly and release of virus 
(Ye and Hogue, 2007). Some, but not all, investigators have shown that for 
SARS-CoV, VLP assembly may require expression of N protein (Hsieh 
et al., 2005; Huang et al., 2004; Siu et al., 2008), but not E protein (Huang 
et al., 2004). Interestingly, although the E protein plays an important role 
in assembly of virus particles (Fischer et al., 1998), the E protein is not 
absolutely required for virion assembly for all coronaviruses. Kuo and 
Masters employed targeted recombination to isolate an MHV mutant that 
carried a deletion in the E gene (Kuo and Masters, 2003). This virus was 
viable, and although it produced tiny plaques and replicated to a much 
lower titer than wild-type virus, it was stable through several passages in 
cell culture. A similar result was obtained for SARS-CoV; a recombinant 
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SARS-CoV lacking the E gene was viable, reaching titers only 1-2 logs 
lower than wild-type virus in cell culture, consistent with the observation 
that E may not be required for SARS-CoV VLP formation (DeDiego et ah, 
2007). This contrasts with results obtained for TGEV, where deletion of 
the E gene was lethal (Ortego et ah, 2007). 

Only a small fraction of the intracellular pool of E protein is assembled 
into virions. The data on intracellular localization of E protein and the 
topology of E protein in membranes are conflicting. Immunofluorescence 
studies in MHV (Raamsman et ah, 2000; Yu et ah, 1994), SARS-CoV (Liao 
et ah, 2006; Nieto-Torres et ah, 2011), IBV (Corse and Machamer, 2000), and 
TGEV (Godet et ah, 1992) infected cells demonstrate that the majority of the 
E protein localizes to juxtanuclear membranes. The precise origin of these 
juxtanuclear membranes containing E appears to vary somewhat from 
virus to virus and from study to study, with E being reported to colocalize 
with Golgi markers (Corse and Machamer, 2000) and with ER markers for 
IBV (Lim and Liu, 2001); SARS-CoV E protein has been reported to coloca¬ 
lize with Golgi (Cohen et ah, 2011; Liao et ah, 2006), ER (Nal et ah, 2005), or 
ERGIC markers (Nieto-Torres et ah, 2011); MHV E protein colocalizes with 
ER and ERGIC markers (Raamsman et ah, 2000); TGEV E protein coloca¬ 
lizes with the ERGIC markers (Ortego et ah, 2007). Although most studies 
examining E protein localization did not report E as being present on 
plasma membranes but rather in an intracellular membranous compart¬ 
ment (see above), several studies reported that a small fraction of E protein 
could also be detected on plasma membranes (Godet et ah, 1992; Pervushin 
et ah, 2009; Yuan et ah, 2006a). However, a recent careful study using four 
different methods failed to detect SARS-CoV E protein at the plasma 
membrane in infected cells (Nieto-Torres et ah, 2011). Some of the differ¬ 
ences in the results obtained in different studies on the same E protein may 
be attributable to the use of N- or C-terminal tags that have the potential for 
interfering with proper targeting of the E protein when overexpressed from 
plasmids. A Golgi-targeting sequence in the C-terminal cytoplasmic tail has 
also been identified for the SARS-E protein (Cohen et ah, 2011) and a 
dilysine-like ER retention signal was identified in the C-terminal 6 amino 
acids of IBV E (Lim and Liu, 2001). The dilysine-like motif is not conserved 
in other coronaviruses, whereas the Golgi-targeting signal is conserved in 
beta and gammacoronaviruses. A chimeric protein containing the VSV 
transmembrane ectodomain and transmembrane domain fused to the E 
protein C-terminal domain was retained in the Golgi rather than trans¬ 
ported to the plasma membrane. The targeting signal, two predicted beta- 
strands flanking a conserved proline residue, was identified by mutagene¬ 
sis. This signal is conserved in the beta and gammacoronaviruses but not 
the alpha coronaviruses and is functional in both the IBV and MHV 
E proteins. The N-terminal half of the E protein appears to contain an 
additional Golgi-targeting signal (Cohen et ah, 2011). 
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A variety of E protein topologies in the membrane have been 
described for different coronaviruses; these include topologies with a 
somewhat longer than usual single transmembrane segment and those 
that describe a hairpin transmembrane domain. TGEV E has been 
reported to have its C-terminus oriented toward the lumen of intracellular 
membranes with its N-terminus exposed to the cytoplasm (Godet et al., 
1992). IBV E protein (Corse and Machamer, 2000) has been reported to 
take up the opposite orientation, with the C-terminus exposed to the 
cytoplasm and the N-terminus of the protein in the luminal position. 
The MHV E protein has been reported to have both the C-terminus 
(Maeda et al., 2001; Raamsman et al., 2000) and its N-terminus (Maeda 
et al., 2001) oriented toward the cytoplasm in a hairpin-like topology. Two 
different topologies have been reported for the SARS protein, one a 
hairpin topology with both the N- and C-termini oriented toward the 
cytoplasm (Arbely et al., 2004; Khattari et al., 2005; Yuan et al., 2006a), and 
a second topology with a single membrane-spanning domain with the C- 
terminus in the cytoplasm and the N-terminus oriented toward the lumen 
(Nieto-Torres et al., 2011; Yuan et al., 2006a). 

In addition to its important role in virus assembly, the E protein has 
several additional effects on infected cells. Overexpression of MHV (An 
et al., 1999) and SARS-CoV (Yang et al., 2005) E proteins resulted in 
apoptosis. Overexpression of Bcl-2 (An et al., 1999) or Bcl-xL (Yang et al., 
2005) inhibited MHV and SARS-CoV-induced apoptosis, respectively, 
and the SARS-CoV E protein was shown to bind Bcl-xL through a BH3- 
like region located in its C-terminal cytosolic domain. The precise mecha¬ 
nism by which E protein triggers apoptosis has not been determined. 
Caution should be observed in interpreting studies in which E protein is 
overexpressed since the level of E protein in infected cells is quite low 
relative to other coronavirus structural proteins; thus infected cells might 
not show all of the biologic effects observed in overexpression studies. 
Teoh et al. (2010) demonstrated that the SARS-E protein interacts with 
PALS1, a protein that is essential for the development and maintenance of 
epithelial tight junctions, through the C-terminal four amino acids which 
interact with the PALS1 PZD domain. This redistributes PALS1 to the 
Golgi and interferes with tight junction formation and thus may contrib¬ 
ute to the acute alveolar damage that characterizes SARS-CoV infection of 
humans. Coronavirus E proteins also have a cation-selective ion channel 
activity (Liao et al., 2004; Parthasarathy et al., 2008; Pervushin et al., 2009; 
Wilson et al., 2006, 2004). The E protein transmembrane domain forms an 
amphipathic alpha-helix which assembles into pentameric bundles in 
model lipid bilayers to form functional ion channels (Pervushin et al., 
2009; Torres et al., 2006). Although most studies of the E protein ion 
channel activity have focused on the SARS-CoV E protein, the human 
coronavirus 229E (HCoV-229E), MHV, and IBV E proteins have also been 
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shown to exhibit ion channel activity (Wilson et al., 2006). This ion channel 
activity is inhibited by hexamethylene amiloride at doses comparable to 
those which inhibit the replication of MHV and HCoV-229E (Wilson et al., 
2006), suggesting that the ion channel activity plays an important, but as 
yet unknown, role in coronavirus replication. The E protein transmem¬ 
brane domain also appears to alter the host secretory machinery to slow 
down transport of cargo proteins to the plasma membrane (Ruch and 
Machamer, 2011). It has not as yet been determined if this effect is 
mediated by the E protein ion channel activity. 

A SARS-CoV mutant that carried a deletion of the E gene (SARS- 
CoVAE) had an attenuated phenotype in several rodent models of 
SARS. (DeDiego et al., 2007, 2008; Lamirande et al., 2008; Netland et al., 
2010). After infection with SARS-CoVAE, animals were protected from a 
subsequent challenge with wild-type SARS-CoV, making SARS-CoVAE a 
potential vaccine candidate. The attenuated phenotype of SARS-CoVAE 
raises the possibility that E has a specific but as yet unknown role in 
pathogenesis. 

C. Membrane (M) protein 

The coronavirus M protein (formerly called El) is a multiple membrane- 
spanning protein containing 221-262 amino acids and it is the most abun¬ 
dant protein in the virus envelope (Cavanagh, 1983; Escors et al., 2001a; 
Godet et al., 1992). M protein consists of a short (~ 25 amino acids for MHV) 
hydrophilic glycosylated N-terminal domain that is exposed on the exter¬ 
nal surface of the virion, followed by three transmembrane domains 
followed by a long C-terminal tail that is positioned in the interior of the 
virus (Armstrong et al., 1984; Rottier et al., 1984). The C-terminal tail 
contains two domains: an amphipathic domain adjacent to the third trans¬ 
membrane domain followed by a short hydrophilic region. The N-termi¬ 
nal domain is O-glycosylated for the majority of the betacoronaviruses 
(Holmes et al., 1981; Lapps et al., 1987; Niemann and Klenk, 1981; Niemann 
et al., 1984), with the SARS-CoV (Nal et al., 2005; Oostra et al., 2006; Voss 
et al., 2006) and the MHV-2 strain of MHV (Yamada et al., 2000) being 
notable exceptions, having an N-glycosylated N-terminal domain. For 
alphacoronaviruses and gammacoronaviruses, the N-terminal domain is 
N-glycosylated (Cavanagh and Davis, 1988; Garwes et al., 1984; Jacobs 
et al., 1986; Stern and Sefton, 1982). The N-terminal domain is exposed on 
the virus surface and is protease sensitive; it is translocated to the lumen of 
the ER after in intro translation of a cDNA encoding the M protein in the 
presence of microsomes (Cavanagh and Davis, 1988; Rottier et al., 1984, 
1986). The N-terminal ectodomain can be recognized by monoclonal anti¬ 
bodies which are able to neutralize viral infectivity in the presence of 
complement (Fleming et al., 1989). The majority of the amphipathic 
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domain in the C-terminal tail is thought to be associated with the viral 
envelope or with the cytoplasmic face of the vesicular compartment where 
virus assembly and budding occurs, based upon its relative resistance to 
protease digestion after in vitro translation in the presence of microsomes 
of a cDNA encoding the M protein (Rottier et al., 1984). 

When expressed alone, M is localized to the Golgi (Krijnse-Locker 
et al., 1994; Machamer and Rose, 1987; Machamer et al., 1990; Nal et al., 
2005). The first transmembrane domain from IBV M protein appears to 
contain the signals that retain this protein in the czs-Golgi and is sufficient 
to retain otherwise plasma membrane-exposed proteins in the Golgi 
(Machamer et al., 1993; Swift and Machamer, 1991). Interestingly, for the 
MHV M protein, it appears that deletion of either of the first two trans¬ 
membrane domains, or of the cytoplasmic tail, results in failure of the M 
protein to be retained in the Golgi (Locker et al., 1994). In the infected cell, 
the M protein is localized to intracellular membranes where virus bud¬ 
ding takes place, the endoplasmic reticulum-Golgi intermediate compart¬ 
ment (ERGIC) (Krijnse-Locker et al., 1994; Tooze and Tooze, 1985; Tooze 
et al., 1984). The localization to the ERGIC in IBV infected cells appears to 
be dependent upon coexpression of the E protein (Lim and Liu, 2001). 

M protein plays a crucial role in the assembly and budding of virus 
particles (reviewed in de Haan and Rottier, 2005). M proteins interact with 
each other, and with the other virion proteins, N, E, S and in some 
betacoronaviruses, HE (de Haan et al., 1998, 1999, 2000; Escors et al., 
2001b; Fang et al., 2005; Kuo and Masters, 2002; Nguyen and Hogue, 
1997; Opstelten et al., 1995). Reverse genetic approaches (Arndt et al., 
2010; de Haan et al., 1998; Hurst et al., 2005; Kuo and Masters, 2002; 
Verma et al., 2006, 2007), experiments utilizing virus-like particles 
(VLPs) (Corse and Machamer, 2003; de Haan et al., 1998; Huang et al., 
2004; Nakauchi et al., 2008; Siu et al., 2008; Vennema et al., 1996), and 
biochemical and two-hybrid studies of protein-protein interactions (de 
Haan et al., 1999; He et al., 2004; Hsieh et al., 2008; McBride and Machamer, 
2010; Narayanan et al., 2000; Nguyen and Hogue, 1997; Opstelten et al., 
1995) have been used to study the role of M protein in the assembly of 
corona viruses. M:M interactions are mediated by the transmembrane 
domains (de Haan et al., 2000; Hida et al., 2000) and by a highly conserved 
stretch of 12 amino acids, SWWSFNPETNNL, that immediately follow the 
third transmembrane domain (Arndt et al., 2010). M:S interactions are 
largely mediated by residues in the cytoplasmic tail although there is 
evidence that residues in the N-terminal half of the molecule contribute 
to this association as well (de Haan et al., 1999; McBride and Machamer, 
2010; Voss et al., 2009). M:N interactions are also mediated by the M 
cytoplasmic domain (Escors et al., 2001b; Fang et al., 2005; Hirano and 
Ruebner, 1965; Hsieh et al., 2008; Hurst et al., 2005; Kuo and Masters, 2002; 
Verma et al., 2006, 2007). In addition to its interaction with the N protein. 
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the MHV M protein has been demonstrated to interact directly with the 
MHV RNA packaging signal in an N protein-independent manner to 
direct packaging of genomes into virions (Narayanan and Makino, 2001; 
Narayanan et al., 2003). An interaction between the IBV M protein cyto¬ 
plasmic domain and P-actin is essential for virus budding and assembly 
(Wang et al., 2009). 

A role of M protein in the induction of an interferon response to 
coronavirus infection was first demonstrated for TGEV, where the induc¬ 
tion of interferon-a in PBMCs by glutaraldehyde-fixed purified virus or 
virus-infected cells could be blocked by monoclonal antibodies that recog¬ 
nized the N-terminal exposed M protein ectodomain (Bernard and 
Hubert, 1988). Studies with VLPs containing M proteins from representa¬ 
tives of the alpha, beta, and gammacoronaviruses indicated that the 
interferogenic property of M protein was not confined to TGEV but 
extended to other coronaviruses as well (Baudoux et al., 1998). Analysis 
of the interferogenic activity of a panel of escape mutants generated with 
monoclonal antibodies directed against the N-terminal ectodomain of the 
M protein implicated N-linked glycans in this region to the induction of 
interferon by TGEV (Laude et al., 1992). Genetic studies with recombinant 
MHV-A59 mutants in which the ectodomain of the MHV M protein had 
been altered to either abolish glycosylation, or to replace the normal 
O-glycosylation sites with N-glycosylation sites also affected their inter¬ 
ferogenic capacity in that viruses-containing N-glycosylated M protein 
were better inducers of interferon than those containing O-glycosylated 
M protein (de Haan et al., 2003). Mutants with unglycosylated M proteins 
were poor interferon inducers, In vivo challenge with these viruses 
demonstrated that their abilities to replicate in the liver, but not brain, 
correlated with their in vitro interferogenic capacity (de Haan et al., 2003). 
This correlation may be the result of binding of virus to lectins, such as the 
mannose receptor, which are abundantly, expressed in the liver but also 
play a role in the induction of interferon-a in dendritic cells. In contrast, 
overexpressed SARS M protein associated with RIG-I, TBK1, IKKs, and 
TRAF3, thus inhibiting activation of IRF3 and IRF7, leading to significant 
suppression of the induction of the interferon-p promoter by dsRNA (Siu 
et al., 2009). It is not clear if this difference in the effects of M protein on the 
induction (or inhibition) of an interferon response reflects real differences 
between the different coronaviruses used in these experiments or rather 
differences between virus infection and overexpression studies. 

D. Hemagglutinin-esterase (HE) 

HE forms a second, smaller spike on the envelope of some betacorona- 
viruses (Kienzle et al., 1990; Smits et al., 2005; Yokomori et al., 1991,1989). 
HE is synthesized as a 42 kDa polypeptide, glycosylated to 65 kDa and 
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disulfide linked to form a homodimer. The MHV HE was observed to 
have 30% sequence homology to the HA1 subunit of the hemagglutinin 
esterase fusion (HEF) protein of influenza C virus (Luytjes et al., 1988), 
leading to the speculation that HE was obtained via nonhomologous RNA 
recombination involving a betacoronavirus after the split off of SARS- 
CoV, which does not encode an HE protein (Snijder et al. r 2003). HE 
protein has sialic acid binding and acetyl esterase (or receptor destroying) 
activities (Brian et al., 1995; Kienzle et al., 1990), which could potentially 
contribute to viral entry and/or release from the cell surface via interac¬ 
tion with sialic acid-containing moieties. HE is an essential protein for 
viruses within the betacoronavirus-1 species, including bovine coronavi- 
rus (BCoV) and HCoV-OC43 (Kienzle et al., 1990; Vlasak et al., 1988a,b). In 
contrast, for MHV, HE is a nonessential protein expressed by some JHM 
isolates in addition to MHV-S and the enteropathogenic strains, including 
MHV-DVIM (Yokomori et al., 1991). It has been speculated that, for MHV, 
HE may function as an initial or additional binding molecule while spike 
mediates binding to a specific glycoprotein receptor on the cell surface, in 
addition to supplying receptor destroying activity to remove the virus 
when attached to nonsusceptible cells or aiding in virus release. More 
recently, it was demonstrated that for the betacoronavirus-1 species, it is 
the spike protein that binds to sialic acid residues which implies that the 
major function of HE, for these viruses, would be release from glycans via 
esterase activity (Wurzer et al., 2002). Thus, for these coronaviruses, it has 
been speculated that during evolution, the spike protein extended its 
receptor specificity or even shifted its specificity to glycan residues 
(Langereis et al., 2010). Consistent with this, a specific protein receptor 
for this species of coronaviruses has not been identified. An alternative 
evolutionary history in which the S protein of an ancestor of all of the 
coronavirus genera had a hemaggulinating (lectin) activity that was lost 
in the MuCoVs sometime after the acquisition of HE from influenza C is 
also possible (Fig. 6). 

A role for HE in infection and/or pathogenesis has not yet been 
defined. While HE is clearly dispensable during replication in cell culture, 
it is conserved among enteropathogenic field strains, suggesting an 
important function in the wild. The genome of the tissue culture-adapted 
A59 strain has multiple mutations in the HE orf as well as in the tran¬ 
scriptional regulatory sequences (TRS), resulting in an inactive HE pseu¬ 
dogene and HE (of strain MHV-S) expression from a chimeric A59/MHV- 
S was lost during passage in vitro. It had long been speculated that HE 
may play a role in acute and/or chronic MHV disease, either as a deter¬ 
minant of organ and/or cellular tropism (Yokomori et al., 1995, 1992, 
1991) or to aid in spread of the virus (Kienzle et al., 1990; Smits et al., 
2005). More recently, a role for HE in spread of MHV in the CNS was 
demonstrated by comparison of isogenic recombinant viruses expressing 
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Ancesteral coronavirus (S+) 



MH V-DVI M(S-/H E+) MHV-S(S-/HE + *) 


FIGURE 6 A speculative evolutionary history of the lectin activity of HE and S proteins. 
S proteins that maintain the ability to hemaggultinate and bind to glycans are denoted 
S+; those that have lost this ability as S—. For SARS-CoV, hemagglutinating activity has 
not been reported and to our knowledge has not been tested for; this is denoted as S? 
HE proteins with a requirement or a strong preference for 9-O-acetyl sialic acid as their 
ligand (and esterase substrate) are depicted as HE+. HE proteins from strains of MHV 
where the preferred ligand (substrate) of HE is 4-O-acetyl sialic acid are depicted as 
HE + *. This schema is supported by the presence of hemagglutinin (lectin-like) activity 
of TGEV (an alphacoronavirus) (Krempl et al., 2000) and IBV (a gammacoronavirus) 
(Niesters et al., 1987), and the presence of a galectin fold in the receptor-binding domain 
of MHV (Peng et al., 2011). 


a wild-type HE protein, an HE protein in which the esterase activity had 
been eliminated, and a vims expressing a truncated HE polypeptide (Kazi 
et al., 2005). The viruses that expressed full-length HE polypeptides (with 
or without a functional esterase activity) were more virulent when inocu¬ 
lated intracranially into mice and spread more extensively in the CNS 
compared to viruses expressing a truncated HE polypeptide. Thus, per¬ 
haps surprisingly, enhanced virulence does not require an intact esterase 
activity, suggesting that HE may instead enhance virus attachment and / 
or spread via binding to sialic acid-containing molecules. Since expres¬ 
sion of the MHV receptor CE AC AMI a is relatively low in the brain, we 
speculate that HE interaction with cell surface molecules may enhance 
attachment to one or more neural cell types. However, isogenic recombi¬ 
nant JHM strains that differ only in expression of HE showed no differ¬ 
ences in neurovirulence, suggesting that in this context HE had no major 
effects on neuropathogenesis. A caveat to this result is that expression of 
HE was at a considerably lower level than that in the chimeric viruses 
expressing the HE protein of the MHV-S strain (T.J. Cowley and Weiss, 
data not shown). The MHV esterase activity may be more important in 
other organs such as the gastrointestinal tract where the virus may need to 
pass through mucous or have the ability to detach cells that may not be 
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productively infected, both believed to be functions of neuraminidases. In 
the case of the influenza neuraminidase, the sialic acid specificity of the 
enzymatic activity was shown to determine the cell subtype infected 
within the respiratory track and hence the pathogenic outcome 
(Matrosovich et al., 2004). 

E. Nucleocapsid protein (N) and Internal (I) proteins 

The nucleocapsid protein (N), a basic RNA-binding protein (Armstrong 
et al., 1983) encoded in the most 3' portion of the MHV genome, plays both 
structural and nonstructural roles in infection. N complexes with genome 
RNA to form the viral capsid (Sturman et al., 1980) and interacts with the 
viral membrane protein (M) during assembly (Hurst et al., 2005) as 
described above. In addition, N associates with genomic and subgenomic 
messenger RNA, binding specifically to the TRS (Baric et al., 1988; 
Grossoehme et al., 2009) and significantly enhances recovery of infectious 
virus from transfected genome length synthetic RNA (Yount et al., 2002). 
It was recently demonstrated that N protein is associated with replication- 
transcription complexes in infected cells and that recruitment of N to 
these complexes requires the C-terminal N2b domain, which interacts 
with other N proteins (Verheije et al., 2010). Interestingly, N is unique 
among MHV structural proteins in that it is partially localized to the 
nucleus of infected cells (Wurm et al., 2001). 

There are data suggesting that N has several potential roles in patho¬ 
genesis of CNS disease as well as in MHV-induced hepatitis. Analysis of 
chimeric viruses in which the N genes of A59 and JHM.SD have been 
exchanged demonstrated that expression of the JHM N protein from 
within the A59 background confers increased neurovirulence character¬ 
ized by increased spread of viral antigen in the brain compared to A59 
(Cowley et al., 2010). In addition, N has been reported to associate with 
microtubules (Pasick et al., 1994), suggesting a possible role in trafficking 
and axonal transport in neurons; however, there are no further reports 
following up these intriguing data and in the context of JHM/A59 chi¬ 
meras N of strain JHM.SD did not enhance spread in cultures of primary 
hippocampal neurons (Cowley et al., 2010). 

N has been reported to antagonize type I interferon by blocking RNase 
L activity; this activity was demonstrated in 17C1-1 cells, a cell type in 
which MHV does not induce type IIFN and may be due to the ability of N 
to bind RNA and thus sequester it from detection by pattern recognition 
receptors and the consequent induction of type I interferon (Ye et al., 
2007). Furthermore, N proteins from hepatotropic MHV-3 and A59, but 
not JHM, were shown to be responsible for the induction of fibrinogen¬ 
like protein 2 (fgl2), a multifunctional protein that has both procoagulant 
and immunosuppressive activities and leads to enhanced liver damage 
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during MHV infection (Ding et al., 1998; McGilvray et al., 1998; Ning et al., 
1999). The N protein of SARS N was also reported to be an interferon 
antagonist. When overexpressed in 293T cells, N protein inhibited 
production of IFN by inhibiting activation of IRF-3 and also inhibited 
activation of expression from an NF-icB-responsive promoter (Kopecky- 
Bromberg et al., 2007). These experiments were carried out with over¬ 
expressed N, which may have acted through its RNA-binding properties. 

The internal protein (I) is a 23 kDa hydrophobic viral membrane- 
associated structural protein of unknown function. The I gene is encoded 
within the +1 reading frame of the N orf (Fischer et al., 199 7). An I gene 
negative recombinant A59 virus displayed no major differences in repli¬ 
cation in vitro or in vivo in the brain or liver compared to its isogenic 
control wild type. Interestingly, all MHV strains express an I gene with 
the notable exception of JHM (Fischer et al., 1997; Parker and Masters, 
1990). Thus, the lack of expression of the I protein could possibly be 
responsible for the inability of JHM to induce hepatitis. However, this is 
unlikely as expression of the I gene from within the JHM genome was not 
sufficient to confer the ability to induce hepatitis (Cowley et al., 2010). 


VI. REPLICASE PROTEINS 

The coronavirus replicase locus is expressed via ppla and pplab, which 
are processed into 16 nonstructural proteins, some of which provide 
essential functions such as the RNA-dependent RNA polymerases, the 
enzymes that modify the 5' end of the genome with a methylated cap 
structure and two or three proteases that process precursor proteins. 
Other replicase proteins provide nonessential functions in virus-host 
interaction. We discuss these proteins below. 

A. Nspl2 polymerase and Nsp8 primase 

Nspl2 is encoded in orf lb, just downstream of and containing the orfla / 
lb translational frame shift region. Nspl2 contains the RNA-dependent 
RNA polymerase (RdRp) core activity that is responsible for replication of 
the viral genome via a negative strand intermediate as well as carrying 
out transcription of the multiple subgenomic mRNAs containing 5' ter¬ 
mini derived from the 5' end of the genome, also via negative strand 
intermediates. Based on structures that have previously characterized 
RdRps, Xu et al. (2003) built a three-dimensional model of the catalytic 
domain and located conserved motifs that are shared by all RdRps. There 
has been little characterization of this activity as a result of difficulties of 
expression of the proteins. However, recently nspl2 was expressed in 
E. coli with its natural N-terminus and was shown to have primer 
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dependent activity in vitro on RNA substrates similar to enzymes from 
poliovirus and Hepatitis C (te Velthuis et al., 2010). Nsp8 has RdRp 
activity that prefers internal 5'-(G/U)CC-3' sequences to initiate synthesis 
of oligonucelotides of less than six residues. In addition, the C-terminus of 
nsp8 has homology with the catalytic palm domain of RNA viral poly¬ 
merases. These data lead to the suggestion that nsp8 may serve as a 
primase to synthesize primers for nspl2 dependent coronavirus RNA 
synthesis (Imbert et al., 2006). There are no data regarding how the 
polymerase mediates the joining of noncontiguous sequences in the gen¬ 
esis of negative stranded subgenomic RNAs that serve as templates for 
mRNAs or which, if any, replicase proteins participate in these processes. 

B. Nspl3 helicase 

Nspl3 is a 66 kDa protein containing an N-terminal zinc finger structure 
linked to a C-terminal superfamily 1 helicase domain. A histidine-tagged 
form of the alphacoronavirus 229E protein was expressed using baculo- 
virus vectors in insect cells (Ivanov and Ziebuhr, 2004) and a SARS-CoV 
nspl3-maltose binding protein (MBP)-fusion was expressed in E. coli 
(Ivanov et al., 2004b). The purified recombinant proteins had both RNA 
and DNA duplex 5'-to-3' unwinding activities. This is the opposite direc¬ 
tion from the well-characterized flaviviral helicases and it has been specu¬ 
lated that this may reflect the synthesis of the multiple subgenomic 
mRNAs and their negative polarity templates. More recently, Sarafianos 
and colleagues (personal communication) expressed a glutathione 
S-transferase-tagged SARS-CoV nspl3 (GST-nspl3) via recombinant 
baculovirus in insect cells and demonstrated that it unwinds nucleic 
acids at rates comparable to other helicases, two to three orders of magni¬ 
tude faster than His tagged or MBP-fusion nspl3 proteins. These data also 
demonstrated that nspl2 complexes with nspl3 and the complex 
unwinds nucleic acids twice as fast as nspl3 alone. Nspl3 also has 
NTPase, dNTPase and 5' triphosphatase activities. The association of 
triphosphatase activity with the helicase has led to the suggestion that it 
may carry out the first step in the capping of genome and mRNAs. 

C. Nspl protein 

Coronavirus nspl protein is the N-terminal protein in the orfla polypro¬ 
tein (ppla), and is cotranslationally cleaved from ppla by a papain-like 
protease (PLP), also contained in orfla (nsp3) (Baker et al., 1989; Bonilla 
et al., 1995; Denison and Perlman, 1987; Denison et al., 1992; Dong and 
Baker, 1994; Hughes et al., 1995; Perlman, 1986; Soe et al., 1987). In viruses 
with two PLP domains, PLP-1 carries out this cleavage; in viruses that 
have only a single PLP domain, that domain catalyzes the proteolytic 
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cleavage of nspl from the ppla precursor. The proteolytic cleavage of this 
protein from its precursor produces nspls of varying sizes depending 
upon the coronavirus genus. While nspls of most of the betacorona- 
viruses are about 245 amino acids long (also called p28), the SARS-CoV 
nspl is 179 amino acids long. Nspls from alphacoronaviruses are shorter, 
about 110 amino acids in length. The sequences of nspls from different 
coronaviruses are highly divergent with a very low level of sequence 
similarity when comparing members of different genera. Only 20% 
sequence similarity can be detected between the MHV and SARS-CoV 
nspls, and even within the less divergent alphacoronaviruses only 
20-50% sequence identity is found in pairwise comparisons (Almeida 
et al., 2007). The avian gammacoronaviruses do not have an nspl homo- 
logue (Almeida et al., 2007). For the SARS-CoV nspl, NMR studies of 
recombinant nspl has shown that residues 13-128 contain a novel alpha/ 
beta fold formed by a six stranded beta barrel with an alpha-helix cover¬ 
ing one end of the barrel and another helix alongside the barrel (Almeida 
et al., 2007). No biochemical activity could be ascribed to this protein from 
this structure. Molecular modeling of HCoV-229E and HCoV-NL63 nspl 
(two alphacoronaviruses) suggests that this beta barrel structure is also 
found in the alphacoronavirus nspl (Wang et al., 2010). These authors also 
found evidence for functional conservation of nspl functional activities 
among these three viruses (see below). 

Immunofluorescence studies have shown that nspl colocalizes with 
proteins found in replication complexes during early times after infection, 
but at later times, the protein colocalizes with the M protein at the site of 
virus budding and assembly (Brockway et al., 2004). Biochemical fraction¬ 
ation studies of infected cells suggest that MHV p28 is also present in the 
soluble fraction of the cytosol at late times postinfection (Bi et al., 1998). 
Yeast two-hybrid and coimmunoprecipitation experiments demonstrated 
potential interactions with nsp7 (P10) and nsplO (P15) (Brockway et al., 
2004). Reverse genetic experiments to assess the function of nspl demon¬ 
strated that for MHV, deletion of the C-terminal half (residues 124-245) of 
nspl was tolerated and gave rise to viable virus, whereas deletion of the 
N-terminal half of the protein was lethal (Brockway and Denison, 2005). 
Two clustered changed-to-alanine mutants (R64A/E69A and R78A/ 
D79A) were also lethal, although other changed-to-alanine mutations in 
the N-terminal half of the molecule were viable. The interpretation of 
these studies is complicated by the presence of cis-acting replication sig¬ 
nals in the MHV RNA genome that extend from the 5'UTR into the first 
half of the nspl coding region (Brown et al., 2007; Kim et al., 1993), raising 
the possibility that the lethality observed was due to changes made to 
these cis-acting sequences. Brian and coworkers have demonstrated that 
purified recombinant BCoV nspl binds in vitro to several cis-acting stem- 
loop structures present in BoCV 5' and 3' UTRs (Gustin et al., 2009). 
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Furthermore, expression of nspl from a DI RNA-encoded subgenomic 
mRNA resulted in a reduction in the replication of the DI RNA relative to a 
control DI construct, but there was only a slight transient reduction in 
helper virus RNA replication. These results suggest that nspl is an RNA- 
binding protein that may function to regulate viral genome translation or 
replication. 

Infection with MHV leads to cell cycle arrest in the G0/G1 phase 
associated with a reduction in the amounts of G1 cyclin-Cdk complexes, 
active Cdk and insufficient phosphorylation of retinoblastoma protein 
(pRb) (Chen and Makino, 2004). The expression of the MHV nspl protein 
from plasmid vectors in uninfected cells resulted in a similar cell cycle 
arrest in G0/G1 and inhibition of cell proliferation, suggesting that this 
effect is due at least in part to nspl (Chen et al., 2004). Examination of cell 
cycle regulatory proteins demonstrated that p28 expression resulted in 
hypophosphorylation of pRb, increased levels of tumor suppressor p53 
and the cyclin-dependent kinase (Cdk) inhibitor p21 Cipl . This suggests 
that p28 expression stabilizes p53, either directly or indirectly, and that 
accumulated p53 causes upregulation of p21 Clpl with subsequent inhibi¬ 
tion of cyclin E/Cdk2 activity, resulting in the inhibition of pRb phos¬ 
phorylation and thus cell cycle arrest in G0/G1. It should be noted that 
another MHV protein, nspl5, also interacts with Rb and thus also influ¬ 
ences the cell cycle (see nspl5 below). Exogenous expression of SARS- 
CoV nspl also results in decreased cell proliferation with an accumulation 
of cells in the G0/G1 phase of the cell cycle (Wathelet et ah, 2007), 
suggesting that this effect on the cell cycle may be exerted by many 
coronavirus nspl proteins. 

Work by the Makino and Baric labs suggests that nspl has a role in 
pathogenesis by blocking both the synthesis of type I interferons in SARS- 
CoV infected cells and the induction of interferon-dependent anti-viral 
proteins such as ISG15 and ISG56 (Kamitani et ah, 2006; Narayanan et al., 
2008a; Wathelet et ah, 2007). There is conflicting data concerning the 
mechanism(s) by which SARS-CoV nspl inhibits the interferon response. 
Wathelet et al. reported that nspl expression inhibits the activation of 
IRF3, NF-KB, and c-Jun, three transcription factors that are required for 
activation of the interferon-P promoter. They also report that nspl expres¬ 
sion blunts the phosphorylation of ST ATI in response to type 1 interferon. 
In contrast, Kamitani et al. failed to detect an effect of nspl expression on 
IRF3 activation using IRF3 dimerization as a readout for activation 
(Kamitani et ah, 2006). Their data demonstrates that exogenously 
expressed SARS-CoV nspl increases the rate of mRNA degradation, 
thus inhibiting the accumulation of interferon-p mRNA (and other 
mRNAs as well) and protein that is normally observed after infection by 
Sendai virus. Further biochemical studies demonstrated that recombinant 
nspl inhibited in vitro translation reactions, bound to the 40S ribosomal 
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subunit, inhibiting 80S ribosome formation but permitting the formation 
of a ternary 48S complex with mRNA (Kamitani et ah, 2009). Primer 
extension analysis of the mRNA in the ternary complex indicated that 
nspl resulted in a modification of the 5' region of the mRNA, rendering it 
translationally incompetent. Interestingly, Wathelet et al. found that 
expression of nspl had variable effects on protein synthesis depending 
upon the protein examined, although it should be noted that the overall 
protein content of cells transfected with nspl expressing plasmids was 
significantly lower than that in cells transfected with control plasmid 
suggesting that overall protein synthesis (or degradation) may be affected 
(Wathelet et ah, 2007). The differences in the mechanisms by which nspl 
blocks the induction of interferon by virus infection reported by the two 
groups may be due to differences in the cell lines employed by the two 
groups, as well as differences in the methodologies they employed. 
It should be emphasized that both groups are in agreement that nspl 
antagonizes the induction of an interferon response. 

Both groups have employed reverse genetic approaches to examine 
the role of nspl in the context of SARS-CoV infection. Mutagenesis 
experiments demonstrated that two basic residues (K164 and H165) 
were crucial for stimulating the degradation of mRNA and inhibition of 
host protein synthesis in cells transfected with nspl expressing plasmids 
(Narayanan et ah, 2008a). A recombinant SARS-CoV encoding a mutant 
nspl (K164A/H165A) grew as well as the wild-type virus but was not as 
efficient as wild-type SARS-CoV in promoting host mRNA degradation 
and in inhibiting host cell protein synthesis. In contrast to the minimal 
amounts of type I interferon that were observed after infection with wild- 
type SARS-CoV, infection of cells with the SARS-CoV nspl mutant 
resulted in the induction of type I interferons at levels similar to those 
observed after infection with Sendai virus, a strong inducer of interferon. 
Wathelet et ah employed two different mutations (R124S/K125E and 
N128S/K129E) that rendered nspl much less effective than wild-type 
nspl in blocking the induction of the interferon-p promoter and of inter¬ 
feron responsive genes (Wathelet et ah, 2007). A recombinant SARS-CoV 
encoding the nspl R124S/K125E mutation replicated as efficiently as 
wild-type virus in cells with a defective interferon response but its repli¬ 
cation was significantly decreased relative to wild virus in cells with an 
intact interferon response. The results from both groups suggest that the 
introduction of mutations into nspl might be effective in attenuating the 
virulence of SARS-CoV by increasing its sensitivity to interferon. 

Studies with nspls from bat coronaviruses belonging to the betacor- 
onavirus genus showed that they were also able to inhibit the induction of 
interferon and interferon responsive genes (Tohya et ah, 2009), suggesting 
that the activity of nspl in blocking the development of an interferon- 
mediated antiviral state in the infected cells is not unique to the 
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SARS-CoV. The alphacoronavirus TGEV nspl, like the SARS-CoV nspl, 
also inhibits host protein synthesis. The mechanism by which TGEV nspl 
accomplishes this may differ between the two viruses, since TGEV nspl 
did not bind to 40S ribosomal subunits, nor did it promote host mRNA 
degradation (Huang et ah, 2011). Thus it appears that nspl from diverse 
coronaviruses may have some common functions in spite of their 
sequence divergence. 

D. Nsp3 protein 

Nsp3 is a 180-200 kDa multifunctional protein, encoded within coronavi- 
rus orfla, containing multiple functional domains, of which the two most 
characterized are a PLP domain, described above, and a macrodomain 
(ADP-ribose l"-phosphatase or ADRP), also referred to as the "X" 
domain, both of which have been shown to be virulence factors for 
MHV and will be discussed. Interestingly. Neuman et al. (2008) using 
mass spectrometry and kinase-profiling techniques identified nsp3 as a 
virion component and based on both bioinformatics analyses and charac¬ 
terization of E. coli expressed proteins, identified two additional RNA- 
binding domains, a chaperone-like domain encoded immediately down¬ 
stream of the PLP domain and a cysteine-coordinated metal ion-binding 
domain (Fig. 7). The PLP of SARS-CoV and the analogous PLP-2 of MHV 
have deubiquitinating activity in addition to protease activity, and it has 
been suggested that this activity could confer type I IFN antagonism 
(Barretto et ah, 2005; Zheng et ah, 2008). Indeed, several studies have 
demonstrated that PLP of SARS-CoV inhibits both the IRF3 and NF-kB 
pathways (Devaraj et ah, 2007; Frieman et ah, 2009). However, there are 
conflicting data regarding the role of MHV PLP-2 as a type I IFN antago¬ 
nist (Frieman et ah, 2009; Zheng et ah, 2008), and it is possible that the 
MHV- and SARS-CoV-encoded proteases may differ in this activity. 

Macrodomains are ubiquitous and highly conserved among many 
viral groups and throughout all eukaryotic organisms, bacteria, and 
archae. The best characterized macro domain is the histone-associated 
MacroH2A, which plays a role in cell type-specific regulation of 

HUB1 - PLP1 HADRPH MBP ||UB2|| PLP2 HNAB|G2MH f(ZFH- ^llY2|Y3| — 

FIGURE 7 A schematic diagram of the domain structure of MHV nsp3. The domains 
depicted are: ubiquitin-related domains (UB1 and UB2), papain-like proteases (PLP-1 and 
PLP-2), ADP-ribose l"-phosphatase (ADRP), metal binding domain (MBD), betacorona- 
virus-specific nucleic acid binding (NAB) and marker (G2M) domains, a putative metal¬ 
binding region containing a zinc finger (ZF), and three subdomains forming part of the Y 
region (Y1-Y3). Transmembrane domains are depicted by vertical bars. Based on Neuman 
et al., 2008. 
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transcription (Changolkar et al., 2008). The ADRPs of several coronaviruses 
(including SARS-CoV, HCoV-229E, and porcine TGEV) were demon¬ 
strated to have phosphatase activity, converting ADP-ribose 1" phosphate, 
into ADP-ribose and inorganic phosphate (Putics et al., 2005, 2006). Muta¬ 
tion of a conserved residue within the MHV ADRP domain resulting in loss 
of enzymatic activity conferred loss of hepatitis and decreases in inflam¬ 
matory cytokine induction (Eriksson et al., 2008). More recently, it was 
concluded that the ADRP domains of SARS-CoV and 229E conferred 
resistance to type I IFN treatment (Kuri et al., 2011). Because MHV, as 
well as some other group II coronaviruses, encode a putative cyclopho¬ 
sphodiesterase activity within the ns2 protein, encoded in ORF2a, just 
downstream of the replicase gene (Fig. 1), it was predicted that together, 
the CPD and ADRP activities could participate in a pathway of nucleotide 
processing (Snijder et al., 2003) in which the CPD would convert ADP- 
ribose-1", 2"-cyclic phosphate into ADP-ribose 1" phosphate and the ADRP 
would convert the product of the CPD, ADP-ribose 1" phosphate, into 
ADP-ribose and inorganic phosphate (Putics et al., 2005, 2006). It seems 
unlikely now that this pathway is utilized during coronavirus infection in 
that most coronaviruses do not express ns2 and in addition there are data 
indicating that MHV ns2 acts as a virulence factor via 2', 5' phosphodies¬ 
terase activity by inhibition of RNase L pathway, not through a CPD 
activity (L. Zhao and S.R. Weiss, unpublished data described further 
below). In addition, the ADRP domains of SARS-CoV, hepatitis E virus 
and Sindbis virus were shown to have only weak enzymatic activity. 
However, these macrodomains also have binding activity to mono- and 
poly-ADP-ribose, implying that they may participate in ribosylation of host 
cell proteins, which may promote apoptosis or necrosis and interfere with 
numerous host pathways (Egloff et al., 2006). Finally, studies of the crystal 
structure of SARS-CoV and the gammacoronavirus IBV ADRP domains 
indicated that the ADRP of IBV fails to bind ADP-ribose implying that 
ADRP functions may differ among viruses (Piotrowski et al., 2009). Thus, 
the function of the ADRP during infection in vivo is still an open question. 

E. Nspl4 protein 

The nspl4 protein is encoded in the coronavirus lb orf and is synthesized 
as pplab precursor from which the mature nspl4 protein of approxi¬ 
mately 60 kDa is proteolytically released by the CoV 3CL pro protease 
(Hegyi and Ziebuhr, 2002; Prentice et al., 2004; Xu et al., 2001) [reviewed 
in (Ziebuhr, 2005)]. The N-terminal half of nspl4 contains a domain 
predicted by a comparative genomics approach to be a member of a 30- 
to-50 member exonuclease (ExoN) family belonging to the DEDD super¬ 
family of proteins (Snijder et al., 2003). It was speculated that this putative 
enzymatic activity could be part of a nucleic acid (RNA) modification 
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pathway also involving nspl5 (NendoU, a predicted endonuclease) and 
nspl6 (a predicted 2'-0-methyltransferase) (Snijder et al., 2003). A bio¬ 
chemical and genetic (complementation) analysis of a panel of MHV-A59 
temperature-sensitive (ts) mutants identified two ts mutants in the nspl4 
gene which were unable to carry out RNA synthesis at the nonpermissive 
temperature, implying an essential role for nspl4 in these processes 
(Sawicki et al., 2005). The predicted 3' —> 5' exoribonuclease activity of 
nspl4 has been characterized biochemically, including identification of 
residues required for enzyme activity (Chen et al., 2007; Minskaia et al., 
2006). The introduction of putative active site mutations into the ExoN 
coding sequence in the HCoV-229E genome by reverse genetics was lethal 
(Minskaia et al., 2006). These mutations had a major effect on viral RNA 
synthesis, greatly decreasing the amount of virus-specific RNA in cells 
transfected by mutant genomes relative to those observed with wild-type 
genomes and significantly altered the amounts and the electrophoretic 
mobility of the HCoV-229E subgenomic mRNA, indicating that the exo¬ 
nuclease activity of nspl4 has an important role in both transcription and 
RNA replication. The introduction of similar mutations into both the 
SARS-CoV and MHV genomes had a much less dramatic effect on repli¬ 
cation, and viable viruses were recovered, although they replicated more 
poorly than the wild-type viruses (Eckerle et al., 2007, 2010). 

The sequence relationships between the CoV ExoN domain and DNA 
polymerase-associated 3' —> 5' exonuclease domains and its greater activity 
with dsRNA substrates than with single stranded RNAs, suggests a possible 
role of nspl4 in proofreading during CoV RNA synthesis (Minskaia et al., 
2006; Snijder et al., 2003). Consistent with this hypothesis, for MHV and 
SARS-CoV, alanine replacement of conserved ExoN active-site residues 
yielded viable mutant viruses that accumulated 15-21 fold more mutations 
than wild-type virus during passage (Eckerle et al., 2010, 2007). The esti¬ 
mated mutation rate for these ExoN mutants was similar to that reported for 
other RNA viruses, approximately 1-3 x 10 _ 5 substitutions per nucleotide 
per replication cycle, whereas that of the wild-type viruses was about 10- 
fold less, suggesting that ExoN contributes to an unusually high fidelity of 
RNA synthesis for coronaviruses. Recently, a yeast genetic functional screen 
for the cap-forming enzymes encoded by SARS-CoV identified a second 
RNA modification activity in nspl4, a (guanine-N7)-methyltransferase (N7- 
MTase) (Chen et al., 2009). This N7-MTase activity mapped to the C-terminal 
half of nspl4. Functional studies of mutants introduced into nspl4 in a 
replicon system showed that the N7-MTase activity is important for 
SARS-CoV replication and transcription (Almazan et al., 2006; Chen et al., 
2009). Interestingly, a mammalian two-hybrid screen for interactions among 
the SARS-CoV nsps indicated that nspl4 interacted with nsplO, a protein 
which also interacted with nspl6 (Bouvet et al., 2010; Pan et al., 2008). The 
interaction of nspl4 with nsplO had little effect on nspl4 N7-MTase activity 
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(Bouvet et al., 2010). An earlier yeast two-hybrid screen revealed additional 
interactions: nspl4 with nsp8 (a putative RdRp primase) and with the 
SARS-CoV 9a accessory protein (von Brunn et al., 2007). A screen directed 
toward identifying cellular interacting partners identified an interaction 
with a cellular protein, DDX1, an RNA helicase in the DExD/H helicase 
family (Xu et al., 2010). siRNA knockdown of DDXI modestly decreased 
viral replication suggesting that this interaction makes a contribution to 
efficient coronavirus replication. 

Nspl4 also may have one or more pathogenesis-related activities. 
Sperry et al. identified a Y414H mutation within the MHV-A59 nspl4 
N7-MTase domain that resulted in the complete attenuation of lethal 
disease after intracranial challenge with this virus while having no effect 
on replication in cell culture (Sperry et al., 2005). This mutation is located 
in a predicted (3-strand that does not contain putative active-site residues 
for the N7-MTase activity (Minskaia et al., 2006). The mechanism for 
attenuation is unknown, nor is the effect, if any, of this mutation on the 
N7-MTase and ExoN activities of nspl4. 

F. Nspl5 protein 

The nspl5 protein is encoded in the coronavirus lb orf and is synthesized 
as part of the pplab precursor polyprotein from which the mature nspl5 
protein of approximately 38 kDa is proteolytically released by the 3CL pro 
protease (Hegyi and Ziebuhr, 2002; Prentice et al., 2004; Xu et al., 2001) 
(reviewed in Ziebuhr, 2005). Nspl5 contains a domain (NendoU) pre¬ 
dicted to be related to a Xenopus laevis U specific endonuclease (XendoU) 
that functions in small nucleolar RNA processing (Snijder et al., 2003). 
Among RNA viruses, this domain is unique to nidoviruses and is present 
in members of the arterivirus family as well as in the coronavirus family. 
Purified recombinant nspl5 from four different coronaviruses represent¬ 
ing the three coronavirus genera exhibited an endoribonuclease activity 
that required divalent cations with a strong preference for Mn 1 1 and 
preferentially cleaved at uridine nucleotides in both single- and double- 
stranded RNA substrates leaving a 2'-3' cyclic phosphate end (Bhardwaj 
et al., 2004; Ivanov et al., 2004a). The enzyme cleaves 3' of the recognition 
uridylate residues and although it cleaves preferentially at U, it also has a 
much slower reactivity with C containing substrates (Bhardwaj et al., 
2006). A 2'-0-methylated RNA substrate was reported to be resistant to 
cleavage by nspl5 (Ivanov et al., 2004a), suggesting a possible role for the 
2'-OMT activity of nspl6 (see below) in regulating nspl5 activity. Purified 
recombinant nspl5 exists in equilibrium between monomeric and hex- 
americ forms, but only the hexameric form is enzymatically active 
(Guarino et al., 2005) and binds RNA (Bhardwaj et al., 2006). Cryoelectron 
microscopy studies and subsequent crystallographic determination of the 
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SARS-CoV (Bhardwaj et al., 2006, 2008; Ricagno et al., 2006) and MHV 
nspl5 structures confirmed the hexameric structure for nspl5 and 
demonstrated that the hexamer was a dimer of trimers (Xu et al., 2006). 
Mutagenesis studies based on the alignment of coronavirus nspl5s with 
the Xenopus XendoU sequence suggested that the catalytic center 
contained two histidines and a lysine that were completely conserved 
and were essential for enzyme activity (Guarino et al., 2005; Ivanov et al., 
2004a). The determination of the nspl5 structure confirmed this assign¬ 
ment and further determined that these catalytic residues were arranged 
in space virtually identically to the catalytic residues of RNase A, suggest¬ 
ing a similar mechanism of action for that enzyme (Bhardwaj et al., 2008; 
Ricagno et al., 2006; Xu et al., 2006). Each monomer in the hexamer has a 
three-domain structure with a small N-terminal domain followed by two 
larger domains and contains an enzymatically active site. Determination 
of the structure of a monomeric form of nspl5 showed how oligomeriza¬ 
tion stabilized the formation of the active site and provided an explana¬ 
tion as to why the monomers are catalytically inactive ( Joseph et al., 2007). 

Functional studies on the role of nspl5 in the coronavirus life cycle 
were initially performed in the TGEV replicon (Almazan et al., 2006) and 
HCoV-229E (Ivanov et al., 2004a) reverse genetic systems. An alanine 
replacement mutation of a TGEV catalytically active histidine residue 
reduced RNA synthesis of the TGEV replicon N RNA to levels below 
1% of those observed with the wild-type replicon. A mutation in a con¬ 
served aspartate that also abrogated NendoU activity of the HCoV-229E 
enzyme also prevented viral RNA accumulation when tested in the 
reverse genetic system, suggesting that nspl5 NendoU activity is required 
for viral replication. However, subsequent structural studies demon¬ 
strated that this particular mutation likely interfered with hexamer for¬ 
mation (Ricagno et al., 2006). A reverse genetic analysis of the related 
arterivirus nspll NendoU domain revealed a more complex pattern, with 
mutation of catalytic residues giving rise to viable viruses that were 
impaired for virus growth and directed decreased levels of viral RNA 
synthesis, particularly subgenomic RNA synthesis, whereas replacement 
of two conserved aspartate residues, one of which corresponded to the 
aspartate mutated in HCoV-229E (Ivanov et al., 2004a), rendered viral 
RNA synthesis and virus production undetectable (Posthuma et al., 
2006). Generally, similar results were obtained with the MHV reverse 
genetic system (Kang et al., 2007). Taken together the data suggest that 
NendoU enzymatic activity plays a role in facilitating maximal viral RNA 
synthesis, but it is not essential for coronavirus replication. The fact that 
mutations in a conserved aspartate residue required for nspl5 hexamer 
formation are lethal suggests that nspl5 has an as yet undiscovered 
essential role in coronavirus replication that is likely dependent on the 
formation of nspl5 hexamers. 
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Examination of betacoronavirus nspl5 amino acids sequence revealed 
that they contained a retinoblastoma (pRb)-binding motif (LXCXE/D) 
located on the surface of the protein near the NendoU active site 
(K. Bhardwaj, C.C. Kao et al., unpublished results). The addition of pRb 
to recombinant nspl5 stimulated endonuclease activity in vitro and the two 
proteins coimmunoprecipitated from cellular extracts. Expression of nspl5 
in cells shifted the cellular distribution of pRb toward the cytoplasm, 
increased ubiquitination of pRb, decreased pRb levels, increased the frac¬ 
tion of cells in the S phase of the cell cycle, and increased the number of foci 
of proliferating 3T3 cells in a transfection assay. Mutation of the LXCXE /D- 
motif in MHV resulted in a viable virus that exhibits a modestly reduced 
growth phenotype. Together these data suggest that nspl5 and pRb inter¬ 
act, and that this interaction alters the regulation of cellular proliferation 
and has subtle effects on coronavirus replication in culture. 

G. Nspl6 protein 

The nspl6 protein is encoded in the coronavirus lb orf and is synthesized 
as pplab precursor from which the mature nspl6 protein of approxi¬ 
mately 33 kDa is proteolytically released by the CoV 3CL pro protease 
(Hegyi and Ziebuhr, 2002; Prentice et al., 2004; Xu et al., 2001) [reviewed 
in (Ziebuhr, 2005)]. Using a comparative genomic approach, Snijder et al. 
identified a conserved S-adenosylmethionine-dependent ribose 2'-0- 
methyltransferase (2'-OMT) domain that contains the conserved K-D-K- 
E catalytic tetrad characteristic of the RrmJ family of methyltransferases 
(Decroly et al., 2008; Snijder et al., 2003). A biochemical analysis of recom¬ 
binant FCoV nspl6 demonstrated that nspl6 did carry the predicted 2'- 
OMT activity (Decroly et al., 2008). Recombinant nspl6 selectively binds 
short-capped RNAs that have previously undergone N7-methylation of 
the guanosine cap, a reaction that is carried out by nspl4 (see Section 
VI.E). Nspl6 then catalyzes the transfer of a methyl group from S-adeno- 
sylmethionine to the 2'-hydroxyl group of the first transcribed nucleotide, 
thereby converting a cap-0 structure to a cap-1 structure (Decroly et al., 
2008). Mutagenesis studies demonstrated that the presumptive catalytic 
amino acids were essential for significant enzymatic activity. Nspl6 has 
been shown to interact with nsplO, an RNA-binding protein (Pan et al., 
2008). Surprisingly, purified SARS-CoV nspl6 had virtually no 2'-OMT 
activity in the absence of nsplO; however, binding of SARS-CoV nspl6 to 
nsplO greatly enhanced 2'-OMT activity to levels higher than that 
observed with FCoV nspl6 (Bouvet et al., 2010; Decroly et al., 2011) and 
a mixture of nsplO, nspl4, and nspl6 proteins efficiently converted cap- 
0 containing RNAs to 2'-0-methylated cap-1 RNAs (Bouvet et al., 2010). 
Guided by the crystal structure of nsplO (Joseph et al., 2006) Lugari et al. 
were able to perform a series of studies of mutant recombinant SARS-CoV 
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nsplO to define the binding surface that interacted with nspl6 (Lugari 
et ah, 2010). Mutations that abolished the nspl0:nspl6 interaction also 
abrogated nspl6's 2'-OMT activity. Determination of the structure of the 
SARS-CoV nspl0:nspl6 heterodimer by X-ray crystallography enabled a 
series of biochemical experiments with nspl6 mutants that defined key 
nspl6 residues for binding nsplO (Decroly et ah, 2011). Studies with nspl6 
mutants also identified key residues for the highly specific binding of N7- 
methylated capped RNAs (Decroly et ah, 2011). Since nsplO also interacts 
with nspl4 as well as nspl6 (Pan et ah, 2008), and nsplO is a non-specific 
RNA-binding protein (Joseph et ah, 2006), it is likely that these three 
proteins act coordinately and we speculate that they may act as a multi- 
component complex. 

To better understand the biologic function(s) of nspl6, viruses-contain- 
ing mutations in nspl6 have been generated by forward and reverse genetic 
methods. A biochemical and genetic (complementation) analysis of a panel 
of MHV-A59 ts mutants identified two viruses with nspl6 mutations, and 
these viruses exhibited defects in RNA synthesis under nonpermissive 
conditions, implicating an important role for nspl6 in viral RNA synthesis 
(Sawicki et ah, 2005). Consistent with this result, a reverse genetic study 
employing a SARS-CoV replicon demonstrated reductions in RNA synthe¬ 
sis of 90% (Almazan et ah, 2006). However, a subsequent reverse genetic 
study with active-site mutants of HCoV-229E and MHV-A59 demonstrated 
that viruses-containing mutations that completely abrogated 2'-OMT activ¬ 
ity (D129A for HCoV-229E, D130A for MHV) were viable (Zust et ah, 2011). 
Although the HCoV-229E D129A mutant had slower growth kinetics and 
reached a peak titer about 10-fold lower than wild-type virus in MRC9 
fibroblasts, the MHV-A59 D130 mutant grew identically to wild type in 
17C1-1 cells, a transformed fibroblast cell line. Both the HCoV-299E D129A 
mutant and the MHV D130A mutant elicited consistently higher levels of 
interferon-)! than wild-type virus when infecting primary macrophages and 
the replication of both mutants was dramatically more sensitive to inter- 
feron-a treatment than wild-type virus. The vigorous induction of inter- 
feron-p by the MHV D130 mutant was completely dependent on MDA-5, 
suggesting that 2'-0-methylation interferes with the sensing of coronavirus 
RNA by MDA-5, the major sensor of coronavirus infection and induction of 
an interferon response in macrophages (Roth-Cross et ah, 2008). The 2'-OMT 
activity of nspl6 was also demonstrated to play a key role in avoiding the 
antiviral effect of IFIT-2 (ISG54) and IFIT-1 (ISG56), two members of the IFIT 
family of proteins that play an important role in the development of an 
antiviral state after interferon treatment (Daffis et ah, 2010; Zust et ah, 2011). 
Infection of C57B1 / 6 mice by intraperitoneal injection showed that although 
wild-type MHV-A59 replicated to high titer in liver and spleen, the nspl6 
D130A mutant failed to replicate and spread in wild-type mice (Zust et ah, 
2011). This phenotype was dependent upon an intact type I interferon 
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response; IFNAR— / — mice failed to restrict replication of the mutant. These 
data suggest that a major function of the nspl6 2'-OMT activity is to allow 
coronaviruses to evade restriction of viral replication by interferon. 


VII. MHV ACCESSORY PROTEINS 

The genomes of all coronaviruses have small accessory proteins, not 
essential for replication in cell culture. Such proteins, encoded in the 
genomes of other virus families, have been shown to have important 
functions in virus host interaction, many of which antagonize the host 
type I interferon response. Accessory proteins are distinct among the 
three coronavirus groups and are completely distinct between MHV 
and SARS-CoV, perhaps indicative of the early evolutionary split off of 
SARS-CoV from the other betacoronaviruses (Snijder et al., 2003), and 
another reason to place them in separate subgroups of betacoronaviruses. 
We will review the accessory proteins of MHV and SARS and their roles 
in antagonizing the host response. 

MHV encodes three such accessory proteins, ns2 (orf2a), ns4 (orf4) [or 
ns4a,4b in some strains (orf4,b)] and ns5a (orf5a). Early on it was shown 
that naturally occurring viruses not expressing ns2 (Schwarz et al., 1990) 
or ns4, 5a (Yokomori and Lai, 1991) were replication competent in vitro, 
confirming that expression of these proteins was nonessential for replica¬ 
tion in vitro in transformed cell lines (de Haan et al., 2002). 

A. ns2 protein 

The most well-studied MHV accessory protein, ns2 is a 30 kDa cytoplas¬ 
mic protein, encoded in orf2a, just downstream of the replicase gene and 
expressed from a distinct mRNA from HE (encoded in orf2b) (Schwarz 
et al., 1990; Zoltick et al., 1990). ns2 contains a domain with high homology 
to a superfamily of 2H phosphoesterases and was predicted to have a 1", 
2"-cyclophosphodiesterase (CPD) activity. The structure of ns2 has been 
predicted based on the cellular phosphoesterase AKAP18, and includes 
two His-x-Thr/Ser motifs for ns2 (Roth-Cross et al., 2009). While expres¬ 
sion of ns2 is nonessential for replication in tissue culture cell lines, it is 
necessary for A59-induced hepatitis in vivo. Introduction of amino acid 
substitutions into the predicted catalytic His residues (Roth-Cross et al., 
2009) attenuates A59 replication in the liver, and reduces hepatitis to 
a minimal level, without affecting viral replication in the brain or enceph¬ 
alitis (Roth-Cross et al., 2009). Thus ns2 is a tissue-specific virulence 
factor and in addition has been shown to antagonize type I interferon 
signaling. Replication of ns2 mutant viruses, in which either of the pre¬ 
dicted catalytic His residues has been replaced is attenuated in bone 
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marrow-derived macrophages (BMM) from wild-type (wt) mice but not 
in BMM derived from type I interferon receptor knockout (IFNAR—/—) 
mice, and in addition, ns2 mutants are more sensitive than wt virus to 
pretreatment of BMM. Consistent with these in vitro data, ns2 mutants 
replicate to nearly the same titers as wt virus after depletion of macro¬ 
phages in vivo (Zhao et al., 2011). Recently, we found that ns2 mutants 
replicate to wild-type titers in BMM isolated from RNase L deficient mice 
and we have obtained evidence, albeit indirect, that ns2, as predicted 
from sequence, has a phosphodiesterase activity (Mazumder et al., 2002) 
that cleaves 2-5A, the activator of RNase L and thus, most likely does not 
act as an IFN antagonist via a cyclophosphodiesterase activity. These data 
imply that the ability of MFIV to replicate in macrophages is a prerequisite 
for replication in the liver and the induction of hepatitis, but not for CNS 
replication and disease, highlighting the importance of IFN signaling in 
macrophages in vivo for the protection of the host from hepatitis. We 
suggest the Kupffer cells, macrophages of the liver, serve as a gateway 
to the live parenchyma and restrict viruses through their robust IFN 
signaling. These ns2 studies point out the cell type and organ type speci¬ 
ficity of the IFN response and the interaction of MHV with that response 
and furthermore, underscore the importance of studying virus-host inter¬ 
action in primary cells rather than or in addition to transformed cell lines. 

B. ns4 protein(s) 

The ns4 gene of the JHM strain of MHV encodes a 14 kDa protein that has 
not yet been detected in infected cells. Comparison of a recombinant JHM. 
IA ablated for ns4 expression with its isogenic wt parent demonstrated 
that ns4 was not essential for high neurovirulence (Ontiveros et al., 2001). 
The ns4 orf of A59 genome has a premature termination codon, convert¬ 
ing orf4 into two smaller orfs 4a and 4b; there are no data regarding 
expression of these proteins or a role in virulence for either one. Thus 
orf4 has been used a site for expression of foreign genes (Chua et al., 2004; 
MacNamara et al., 2005; Zhou and Perlman, 2006). It has, however, not 
been unambiguously shown that the gene products of orf4 do not have an 
as yet undetected role in virus host interaction, possibly in an organ 
specific way as has been observed for ns2. 

C. ns5a protein 

Orf5a encodes ns5a, a protein of approximately 13 kDa, encoded in an 
upstream orf on the same mRNA as the E protein (orf5b). In vitro transla¬ 
tion studies suggested that ns5b is translated from an internal ribosome¬ 
binding site (Budzilowicz and Weiss, 1987; Skinner et al., 1985) and orf5a 
was less efficiently translated than orf5b in an in vitro system and has not 
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yet been detected in infected cells. Orf5a has been reported to encode a 
type I IFN antagonist activity. A recombinant chimeric virus expressing 
the 3' end of MHV-A59 and the replicase gene of MHV-S was strikingly 
more sensitive to IFN pretreatment of L2 cells than parental A59, similar 
to that of mildly virulent MHV-S (Koetzner et al., 2010). An A59 mutant 
abrogated for expression of orf5a displayed intermediate sensitivity to 
type I IFN pretreatment of L2 cells, supporting the finding that orf5a is an 
IFN antagonist and also implying that other genes encoded in more 5' 
regions within the A59 genome contribute to IFN antagonist activity as 
well (Koetzner et al., 2010), consistent with the ns2 findings described 
above. Further unpublished studies supporting these data showed that 
the ns5a mutant was more sensitive than wt A59 in replication in BMM, 
but that replication is recovered in BMM derived from IFNAR—/— mice 
(data not shown). The virulence conferred by ns5 is through a different 
mechanism from that of ns2 in that the ns5a mutant was attenuated for 
replication in the CNS as well as in the liver (data not shown). Thus the 
mechanism by which ns5a confers IFN antagonism is not yet understood. 


VIII. SARS ACCESSORY PROTEINS 

The SARS-CoV genome encodes a number of accessory proteins with no 
identified homologies to those of MHV or other known host cell proteins. 
SARS-CoV accessory proteins are encoded in orfs 3a, 3b; 7a, 7b; 8a, 8b; 9b. 
The orf3b,7b, 8b encoded proteins are translated via internal downstream 
initiation codons from the same mRNAs as 3a,7a,8a, respectively. Of 
these, proteins encoded in orfs3a, 6, 7a, and 7b have all been found in 
virus particles (Narayanan et al., 2008b). Systematic deletions individually 
and in combination of orfs 3a, 3b, 6, 7a, or 7 in recombinant viruses 
demonstrated that none of these orfs is essential for replication in cell 
culture, demonstrating that like ns4 and ns5a of MHV, these are nones¬ 
sential proteins. However, there was some loss efficiency in replication 
particularly for a virus with the deletion of orf3a (Yount et al., 2005). 
Importantly, there are data, however, showing that the orf3b and 6 have 
type I interferon induction and signaling antagonizing activities while 
orfs3a and 7a have roles in interfering with signaling pathways including 
apoptosis, as described below. Functions for the proteins encoded in the 
other accessory orfs are as yet not known. 

A. orf6 

SARS-CoV orf6 encodes a 63-amino acid endoplasmic reticulum (ER)/ 
Golgi membrane-associated protein, which is expressed in cell culture and 
in the lung and intestines of infected patients (Narayanan et al., 2008b). 
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The orf6 encoded protein was shown to be a virulence factor; when 
expressed ectopically from within an MHV genome, orf6 protein con¬ 
ferred lethality upon a nonlethal JHM isolate (Tangudu et al., 2007). The 
orf6a protein was further demonstrated to function as a virulence factor 
within the SARS-CoV genome. The orf6 protein inhibits nuclear import 
and as such inhibits interferon signaling by preventing import of ISGF3 
(STAT1/STAT2/IRF-9), the transcription factor that mediates expression 
of type I interferon-stimulated genes or STAT1/STAT2 complexes follow¬ 
ing IFN-y treatment. The C-terminal tail of orf6 binds karyopherin alpha 
(KPNA)2, which recruits KPNB1 a component of the classical nuclear 
import complex, thus blocking proteins with classical import signals 
(Frieman et al., 2007; Hussain et al. r 2008). Interestingly Ebola virus, 
another human pathogen from a different virus family encodes VP24 
which inhibits host nuclear import, illustrating that this strategy is utilized 
by multiple viruses in modulating host responses to viral infection. When 
expressed in the absence of other viral proteins, orf6 induces the formation 
of membranous structures, similar to double membrane vesicles involved 
in virus replication and in addition partially colocalizes with nonstruc- 
tural protein 3 (nsp3) (Zhou et al., 2010), leading to the suggestion that orf6 
protein is also involved in virus replication. 

B. orf3b 

Several functions have been reported for orf3b encoded protein, which is 
expressed during infection of patients (Chan et al., 2005). Overexpressed 
orf3b protein was localized primarily to the nucleus in A549 cells in 
culture and was shown to inhibit both interferon induction and signaling 
(Kopecky-Bromberg et al., 2007). Other reports concluded that expressed 
orf3b also induced cell growth arrest (Yuan et al., 2005) or apoptosis and 
necrosis (Khan et al., 2006). Most of the data available regarding orf3b 
protein activities come from overexpression studies, and there is little 
information on the activities of orf3b during infection. 

C. orf7a 

The orf7a-encoded protein is a 122 amino acid type I transmembrane 
protein, localized to perinuclear regions in SARS-infected cells (Nelson 
et al., 2005) through interactions with M and E (Huang et al., 2006). The 
precise subcellular localization of the orf7a protein has been disputed (ER, 
ERGIC Fielding et al., 2006), trans Golgi (Nelson et al., 2005). Likewise, 
several biological activities have been reported for the orf7a encoded 
protein, including induction of apoptosis through a caspase-dependent 
pathway (Tan et al., 2004), inhibition of cellular protein synthesis, activation 
of p38 mitogen activated protein kinase (Kopecky-Bromberg et al., 2006) 
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and cell cycle arrest at Go/G1 (Yuan et ah, 2006b). As with the orf3a encoded 
protein, most of the data has been obtained from overexpression studies 
rather than from infected cells. 


IX. CONCLUSIONS AND FUTURE DIRECTIONS 

The coronavirus field, viral replication, cell biology and pathogenesis has 
advanced quickly in part due to the availability of reverse genetics sys¬ 
tems but also, in the past 7 or 8 years due to the increased interest in this 
class of viruses following the SARS epidemic and resources added to the 
study of these viruses. The impressive speed with which the SARS-asso- 
ciated virus was identified and the genome sequenced was made possible 
by the data accumulated previously on the other members of the corona¬ 
virus family, illustrating the value of basic science research. There are still 
some important and intriguing questions to be addressed about corona- 
viruses, a few of them being the following. 

What determines the varied organ tropisms among MHV strains? One 
of the remaining puzzling aspects of MuCoV pathogenesis is how MHV 
strains have different organ tropisms despite the observations that they 
all use the same cellular receptor, CEACAMla, and the lack of evidence 
for an alternative receptor. This is in part explained by the contributions 
of other virus genes and postviral entry events to the determination of 
tropism; however, the mechanisms underlying differential organ tropism 
are not at all understood. 

How the very different cell type and organ type specific levels of 
CEACAMla receptor expression influence tropism? While the liver 
expresses detectable levels of CEACAMla protein, CEACAM protein is 
undetectable in the brain and the levels of mRNA are expressed in the 
brain are approximately 100-fold less than in the liver. Neurons are the 
most frequently infected cell type and express levels of CEACAMla 
mRNA at or near the level of detection, yet despite this observation, the 
brain remains a major target of MHV infection. Thus there are still unan¬ 
swered questions regarding the requirements for CEACAMla expression 
and other potential receptors for efficient infection and spread of MHV 
in vivo. 

What are all the replicase proteins/activities for? The coronavirus 
replicase coding region is longer (approximately 21 kb) than most RNA 
viruses. Granted that the Coronavirus discontinuous mode of mRNA 
synthesis is more complex than that of other RNA viruses, the closely 
related Arteriviruses are also members of the Nidovirus family and 
synthesize their mRNAs by the same general mechanism but do so 
using much less genetic information. This raises the question as to what 
functionalities this additional genetic potential encodes. As exemplified 
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by nspl and nsp3, it is likely that at least some of this additional genetic 
potential is directed toward manipulation of the host environment 
to directly further virus replication or toward aiding immune evasion. 
A number of the nsps or domains within nsps (e.g., nsp3) have no 
biochemical activity associated with them, or if a biochemical activity 
has been demonstrated, an in vivo correlate of that activity has not yet 
been demonstrated. Unraveling these functions will continue to be fruit¬ 
ful area of coronavirus research. 

Will SARS or another HCoV emerge from its reservoir? The data 
suggest that SARS adapted to humans by only a few mutations into the 
viral spike proteins. It seems like this could happen again given the 
identification of numerous bat SARS-like viruses and the finding of 
SARS-like virus in animal such as the civet. 
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